Test and model for inflammatory disease

ABSTRACT

The present invention relates to a means and methods for determining susceptibility to SEEK1 mediated diseases, such as psoriasis. In addition, there is provided polynucleotides encoding the SEEK1 protein having one or more nucleotide insertions, deletions or substitutions at one or novel positions, and the SEEK1 protein having one or more amino acid insertions, deletions and substitutions. Host cells and transgenics non-human animals comprising polynucleotides or proteins of the invention are also provided. Methods of screening for agents for use in treating SEEK1 mediated disease are also provided.

[0001] The present invention relates to polymorphisms in the SEEK1 gene and protein, and the exploitation of these polymorphisms in the detection and/or treatment of SEEK1 mediated disease, for example inflammatory diseases including psoriasis. The present invention also relates to polynucleotides encoding the SEEK1 protein, and having one or more nucleotide polymorphisms, and to a protein encoded by said polynucleotides. Also provided are transgenic non-human animals comprising the polynucleotides of the present invention; and methods and kits for treating, diagnosing or determining susceptibility to SEEK1 mediated disease, in particular by way of gene therapy.

[0002] In recent years, it has been recognised that there is considerable genetic diversity in human populations, with common polymorphisms occurring on average at least every kilobase in the genome. Polymorphisms which affect gene expression or activity of the encoded gene product may account for susceptibility to, or expression of, disease conditions, either directly or through interaction with other genetic and environmental factors.

[0003] Understanding the molecular basis for disease, by sequencing the human genome and characterising polymorphisms, will enable the identification of those individuals at greatest risk of disease. This will allow the better matching of treatment and disease, and enable the production of new and improved targets for drugs. Screening and treatment of disease may also be better targeted to those in need, thus increasing the cost-effectiveness of health-care provision.

[0004] One area in need of such approaches is the diagnosis and treatment of inflammatory diseases. Inflammation, which can be broadly defined as the destructive sequelae to activation of elements of the body's immune system, is a feature of many diseases including infection, autoimmune disorders and benign and malignant hyperplasia. The identification of genetic factors which influence susceptibility to such disorders will provide important new insights into inflammatory disease, and may yield important new diagnostic and/or prognostic tests and treatments.

[0005] Psoriasis is a chronic inflammatory cutaneous disorder which affects approximately 2% of the population in the UK and US, and causes varying degrees of physical discomfort, pain and disability. Psoriasis manifests itself as red scaly skin patches, principally on the scalp, elbows and knees, and is caused by epidermal hyperproliferation, and abnormal differentiation and infiltration of inflammatory cells. Psoriasis may also be associated with other inflammatory diseases such as arthritis, Crohn's disease, and HIV infection. Population, family, and twin studies all suggest an important genetic component in the pathogenesis of psoriasis, coupled with environmental triggers such as streptococcal infection and stress.

[0006] Psoriasis is one of a number of autoimmune diseases that display significant human leukocyte antigen (HLA) associations. The analysis of population-specific HLA haplotypes has provided evidence that susceptibility to psoriasis is linked to the class I and II major histocompatibility complexes (MHC) on human chromosome 6. These studies show that psoriasis consists of two distinct disease subtypes (Type I and Type II), which differ in age of onset and in the frequency of HLA types. Type I psoriasis has an age of onset of prior to 40 years and HLA types Cw6, B57, and DR7 are strongly increased. Patients with Type I psoriasis are much more likely to have a positive family history for the disease. In contrast, only about 10% of Cw6-positive individuals develop Type II psoriasis disease, with HLA-Cw2 being over-represented in this group.

[0007] Linkage analysis and association studies suggest the presence of a major genetic determinant of psoriasis within the MHC, the strongest candidate gene marker being HLA-C. The most significant association has been shown between HLA-Cw6 and disease Type IA, which has the earliest onset of disease at 0 to 20 years. However, specific involvement of the HLA-Cw6 genotype in disease pathogenesis has yet to be established. At present, the causes of psoriasis are unknown. There is no specific test for psoriasis or susceptibility thereto, and diagnosis is based solely on clinical examination and skin histopathology.

[0008] The present invention aims to overcome or ameliorate previous limitations in the art by providing means and methods for the detection and treatment of individuals having, or being susceptible to inflammatory diseases such as psoriasis.

[0009] Thus, in a first aspect of the present invention, there is provided a method of diagnosing, or determining susceptibility of a subject to, inflammatory disease such as psoriasis, the method comprising determining the presence of one or more polymorphisms in the SEEK1 gene or protein. The method may be used to identify the presence of a combination of polymorphisms in a subject which define a haplotype linked to inflammatory disease. The haplotype may be any particular combination of the polymorphisms, optionally including known polymorphisms.

[0010] The present invention is based upon the realisation that SEEK1 is involved in epidermal differentiation, and the gene is involved in determining onset of inflammatory disease. SEEK1 is expressed in skin, in particular keratinocytes. The SEEK1 gene is located approximately 160 kb telomeric of the HLA-C locus, in a cluster of non-HLA genes. This gene cluster, termed the MHC epidermal gene cluster MHC-EGC), spans approximately 50 kb genomic DNA and contains 5 genes, HCR, SPR1, CDSN, STG and SEEK1. SEEK1 is transcribed in the opposite orientation to the other four genes. The SEEK1 gene consists of 6 exons spanning approximately 24.8 kb of genomic DNA sequence. A SEEK1 mRNA transcript of 861 bp has been reported (Genbank accession AB031479) producing a predicted peptide 152 amino acids in length, which is rich in proline and serine residues, a characteristic feature of proteins involved in epidermal differentiation (South et al. (1999) J. Invest. Dermat. 112:910-918). EST sequences with homology to SEEK1 are reported to have been isolated from cDNA libraries synthesised from colon, uterus, ovary, testis and breast tissues (Genbank accession numbers AI343394, AI339603, AA127234, AI208110, AI379146, R50247, AA045454, AI243345, BE042193). Unlike HCR, SPR1, CDSN and STG, no mouse orthologue of the SEEK1 gene has been identified.

[0011] In the present text, and according to the present invention, the SEEK1 gene is that of GenBank Accession No. AP000510, which includes the 5′ promoter sequences, coding and non-coding exonic sequences, intronic sequences and 3′ untranslated sequences, all present on the MHC region of chromosome 6p21.3. The mRNA clone of SEEK1 (GenBank Accession No. AB031479) is shown in FIG. 1. A consensus genomic DNA sequence for SEEK1 is set out in FIG. 2.

[0012] A polymorphism is typically defined as two or more alternative sequences, or alleles, of a gene or protein in a population. A polymorphic site is the location at which divergence in sequence occurs. Examples of the ways in which polymorphisms are manifested include restriction fragment length polymorphisms, variable number of tandem repeats, hypervariable regions, minisatellites, di- or multi-nucleotide repeats, insertion elements and nucleotide or amino acid deletions, additions or substitutions. The first identified allele is usually referred to as the reference allele, or the wild type. Additional alleles are usually designated alternative or variant alleles. Herein, the sequence detailed in GenBank Accession No AP000510, the SEEK1 consensus genomic DNA sequence or FIG. 1 are designated the reference sequence. The Genbank sequence AP000510 and FIG. 1 are not part of the invention. Nucleic acid sequences which differ from the sequence of AP000510, the consensus sequence herein, or FIG. 1 at one or more positions may be referred to as variants.

[0013] A single nucleotide polymorphism is a variation in sequence between alleles at a site occupied by a single nucleotide residue. Single nucleotide polymorphisms (SNP's) arise from the substitution, deletion or insertion of a nucleotide residue at a polymorphic site. Typically, this results in the site of the variant sequence being occupied by any base other than the reference base. For example, where the reference sequence contains a “T” base at a polymorphic site, a variant may contain a “C”, “G” or “A” at that site. Single nucleotide polymorphisms may result in corresponding changes to the amino acid sequence. For example, substitution of a nucleotide residue may change the codon, resulting in an amino acid change. Similarly, the deletion or insertion of three consecutive bases in the nucleic acid sequence may result in the insertion or deletion of an amino acid residue. For ease of reference, where a single nucleotide polymorphism of the present invention results in the insertion or deletion of a nucleotide or amino acid residue, the numbering system of FIG. 1, the consensus sequence herein and AP000510 have been maintained.

[0014] The single nucleotide polymorphisms of the present invention which occur within the protein coding sequence may contribute to the phenotype of an organism by affecting protein structure or function. The effect may be neutral, beneficial or detrimental, depending upon the circumstances. Whatever the effect, the identification of such polymorphisms enables for the first time determination of susceptibility to disease, and new methods of treatment. The single nucleotide polymorphisms of the invention which occur in the non-coding 5′ or 3′ untranslated regions, may not affect protein sequence, but may exert phenotypic effects by RNA transcription, processing and/or translation. A polymorphism may affect more than one phenotypic trait or may be related to a specific phenotype. In the present invention, polymorphisms in the SEEK1 gene are likely to affect the phenotype of an individual with respect to SEEK1 mediated disease, such as inflammatory disease, in particular psoriasis.

[0015] In a preferred embodiment, the present invention provides a method of diagnosing or determining susceptibility to SEEK1 mediated disease, said method comprising determining the presence of a nucleotide substitution, deletion or insertion at one or more of positions 51814, 51789, 51759, 51570, 51505, 51462, 51265, 51216, 51124, 51078, 51017, 51008, 50920, 50901, 50801, 50049, 49405-49407, 49160, 49133, 49045, 49038, 49017, 48920, 48773, 47938, 47868, 47852, 47826, 47661, 47645, 47567, 47547, 47508, 47507, 47438, 46831, 46806, 46784, 39881, 39880, 39851, 39725, 39722, 39702, 35884, 35732, 27006, 26915, 26770, 26724, 26694 26684, 26675-26682, 26576, 26539, 25534, 25458 and 25449 of SEEK1 gene as represented by FIG. 2. Some of these positions correspond to positions: 16549, 16548, 16519, 16393, 16390, 16370, 12553, 12401, 3676, 3585, 3444, 3394, 3364, 3354, 3352, 3247, 3210, 2205, 2126, and 2120 of Genbank sequence AP000510. In the present invention, it is the position of the polymorphism which is the novel and limiting feature: the reference to the gene sequence simply confirms that the polymorphism is present in the SEEK1 gene or protein. The sequence need or may not be fully identical to that given in any one of the reference sequences. This applies both to gene and protein reference sequences. The relationship between the positions in the consensus sequence of FIG. 2 and in the Genbank sequence AP000510 are as follows: SNP name, referring to SNP Number, referring to Genbank sequence AP000510 consensus sequence of FIG. 2 G16549A SNP39 G16548A SNP40 C16519T SNP41 A16393G SNP42 G16390A SNP43 G16370A SNP44 C12553T SNP45 G12401T SNP46 T3676C SNP47 A3585G SNP48 A3444G SNP49 C3394A SNP50 G3364A SNP51 G3354A SNP52 C3352Ins/Del[C] SNP53 C3352Ins/Del[C] SNP53 C3352Ins/Del[C] SNP53 A3247T SNP54 G3210A SNP55 C2205T SNP56 C2126Del[C] SNP57 C2120T SNP58

[0016] These novel polymorphisms in the SEEK1 gene, at the positions indicated above, have been identified as being involved in SEEK1 mediated disease. In particular, the polymorphisms of the present invention may be useful in identifying individuals being susceptible or resistant to SEEK1-mediated disease, and in the diagnosis or treatment of such conditions.

[0017] In this text, diseases in which SEEK1 is implicated in the pathology will be referred to as “SEEK1-mediated disease”. Such diseases include inflammatory disease such as psoriasis. In particular, the inflammatory disease is of the skin, most particularly skin psoriasis.

[0018] The single polymorphisms of the invention have each been given a positional reference with respect to the consensus sequence of FIG. 2 (see Table 1(iii)). Some of them also have a positional reference with respect to sequence of GenBank Accession No. AP000510 (see Table 1 (i), column 1). However, it should be noted that the native SEEK1 gene is transcribed in the opposite orientation to AP000510 and the consensus sequence. In addition, for ease of reference, polymorphisms occurring in the coding sequence of SEEK1 are also given a positional reference with respect to the SEEK1 mRNA sequence. The fragments of the SEEK1 gene comprising the polymorphisms (as shown in Table 1(i), column 3 and in Table 1(iii), column 3) are fragments of the sequence of GenBank Accession No AP000510. These fragments can be readily aligned with the genomic sequence of GenBank Accession No. AP000510, or other clones of this region, using methods known to the person skilled in the art, for example by comparing the nucleotide sequence of the fragment with the sequence of the MHC-EGC region by using computer programs such as DNASIS (Hitachi Engineering, Inc.) or Word Search or FASTA of the Genetic Computer Group (Madison, Wis.).

[0019] The present invention can also thus refer to the polymorphisms in the SEEK1 gene (coding and non-coding) and the SEEK1 protein (coding) by reference to the fragments as set out in Table 1(i) and Table 1(iii).

[0020] Any method, including those known to persons skilled in the art, may be used to determine which allele of one or more polymorphisms is present. Preferably, the method comprises first removing a sample from a subject. More preferably, the method comprises isolating from a sample a polynucleotide or protein to determine therein which allele of one or more polymorphisms of the invention is present. Any biological sample comprising cells containing nucleic acid or protein is suitable for this purpose. Examples of suitable samples include whole blood, semen, saliva, tears, buccal, skin or hair. For analysis of cDNA, mRNA or protein, the sample must come from a tissue in which the SEEK1 gene is expressed, and thus it is preferable to use skin samples.

[0021] Any method for determining alleles in a polynucleotide may be used, including those known to persons skilled in the art. One example of a widely-available technique is direct DNA sequencing of PCR products containing the polymorphism to be tested. However, and preferably, the method may comprise the use of anti-sense polynucleotides, such as those of the present invention, as defined below. Such polynucleotides may include sequences which are able to distinguish between alleles of one or more polymorphisms, by preferential binding, and sequences which hybridise under stringent conditions to a region either side of a polymorphism to enable amplification of one or more of the polymorphisms.

[0022] Methods of this embodiment include those known to persons skilled in the art, for example, direct probing, allele specific hybridisation, and PCR-based methods including sequencing of PCR products, Allele Specific Amplification (ASA), RFLP, single base extension and rolling circle amplification following allele-specific ligation.

[0023] Determination of an allele of a polymorphism using direct probing involves the use of anti-sense sequences. These may be prepared synthetically or by nick translation. The anti-sense probes may be suitably labelled using, for example, a radiolabel, enzyme label, fluoro-label, biotin-avidin label for subsequent visualization in, for example, a southern blot procedure. A labelled probe may be reacted with a sample DNA or RNA, and the areas of the DNA or RNA which carry complimentary sequences will hybridise to the probe, and become labelled themselves. The labelled areas may then be visualized, for example by autoradiography.

[0024] Preferably, the method may first comprise the amplification of a region of the SEEK1 gene containing one or more of the polymorphic sites of the invention, for example, using PCR techniques. Probes of the present invention may be useful for this purpose.

[0025] The above described methods may require amplification of the DNA sample from the subject, and this can be done by techniques known in the art, such as PCR (see PCR Technology: Principles and Applications for DNA Amplification (ed. H. A. Erlich, Freeman Press, NY 1992; PCR Protocols: A Guide to methods and Applications (eds. Innis et al., Academic press, San Diego, Calif. 1990); Mattila et al., Nucleic Acids Res. 19 4967 (1991); Eckert et al., PCR Methods and Applications 117 (1991) and U.S. Pat. No. 4,683,202. Other suitable amplification methods include ligase chain reaction (LCR) (Wu et al., Genomics 4 560 (1989); Landegran et al., Science 241 1077 (1988)), transcription amplification (Kwoh et al., Proc Natl Acad Sci USA 86 1173 (1989)), self sustained sequence replication (Guatelli et al., Proc Natl Acad Sci USA 87 1874 (1990)) and nucleic acid based sequence amplification (NASBA). The latter two methods both involve isothermal reactions based on isothermal transcription which produce both single stranded RNA and double stranded DNA as the amplification products, in a ratio of 30 or 100 to 1, respectively.

[0026] It may often be desirable to identify the presence of multiple single nucleotide polymorphisms in a sample from a subject. This may be the case in the present invention where the SEEK1 gene contains at least 21 polymorphisms, each of which may be indicative of a different phenotype of inflammatory disease. For this purpose, nucleic acid arrays may be useful, as described in WO95/11995. The array may contain a number of probes, each designed to identify one or more of the above single nucleotide polymorphisms of the SEEK1 gene, as described in WO95/11995.

[0027] In a preferred embodiment of the first aspect, there is provided a method for diagnosing or determining susceptibility to SEEK1 mediated disease, the method comprising determining the presence of an amino acid substitution, deletion or insertion at one or more of positions 24, 34, 37, 40 or 133 of the SEEK1 amino acid sequence, represented by FIG. 3(i), or the presence of a protein fragment having the amino acid sequence as represented by FIG. 3(ii) or 3(iii). Any method for determining the presence of a particular form, or allele, of a protein is present, may be used. One such method involves the use of antibodies in diagnosing or determining susceptibility to SEEK1 mediated disease. The method may comprise removing a sample from a subject, contacting the sample with an antibody to an antigen of a SEEK1 protein or protein fragment and detecting binding of the antibody to the antigen, wherein binding is indicative of the presence of a particular allele or form of the protein and thus risk to SEEK1 mediated disease. Tissue samples as described above are suitable for this method.

[0028] The detection of binding of the antibody to the antigen in a sample may be assisted by methods known in the art, such as the use of a secondary antibody which binds to the first antibody, or a ligand. Immunoassays including immunofluorescence assays (IFA) and enzyme linked immunosorbent assays (ELISA) and immunoblotting may be used to detect the presence of the antigen. For example, where ELISA is used, the method may comprise binding the antibody to a substrate, contacting the bound antibody with the sample containing the antigen, contacting the above with a second antibody bound to a detectable moiety (typically an enzyme such as horse radish peroxidase or alkaline phosphatase), contacting the above with a substrate for the enzyme, and finally observing the colour change which is indicative of the presence of the antigen in the sample.

[0029] In a second aspect of the present invention, there is provided an isolated or recombinant polynucleotide comprising a nucleic acid sequence encoding the SEEK1 gene as represented by the consensus sequence of FIG. 2, of AP000510, wherein the nucleic acid sequence comprises a nucleotide substitution, deletion or insertion at one or more of positions 51814, 51789, 51759, 51570, 51505, 51462, 51265, 51216, 51124, 51078, 51017, 51008, 50920, 50901, 50801, 50049, 49405-49407, 49160, 49133, 49045, 49038, 49017, 48920, 48773, 47938, 47868, 47852, 47826, 47661, 47645, 47567, 47547, 47508, 47507, 47438, 46831, 46806, 46784, 39881, 39880, 39851, 39725, 39722, 39702, 35884, 35732, 27006, 26915, 26770, 26724, 26694 26684, 26675-26682, 26576, 26539, 25534, 25458 and 25449 of FIG. 2.

[0030] The polynucleotide of this invention is preferably DNA, or may be RNA or other options.

[0031] As discussed above, where a single nucleotide polymorphism of the present invention comprises a nucleotide substitution, the substitution may comprise the replacement of the reference base at a polymorphic site with any other base. Each nucleic acid sequence of Table 1(i), column 3 and Table 1(iii), column 3 comprising a single nucleotide polymorphism represents a preferred embodiment of the invention.

[0032] It will be appreciated by those skilled in the art that SEEK1 gene sequences of the invention may comprise one or more nucleotide substitutions, deletions or insertions in addition to one or more of the single nucleotide polymorphisms of the invention.

[0033] In a third aspect, fragments of the above polynucleotides are provided, which comprise one or more nucleotide substitutions, insertions or deletions at one or more of the above mentioned positions of the SEEK1 gene, as represented by consensus sequence. Preferably, a fragment may comprise, or even consist of, the polynucleotide sequence of Table 1 (i), column 3 or Table (iii), column 3. The novelty of a fragment according to the present embodiment may be easily ascertained by comparing the nucleotide sequence of a fragment with sequences catalogued in databases such as GenBank, or by using computer programs such as DNASIS (Hitachi Engineering, Inc.) or Word Search or FASTA of the Genetic Computer Group (Madison, Wis.).

[0034] Preferably, the fragments do not encode a full length protein, as is generally the case with the aforementioned polynucleotides of the second aspect, but otherwise satisfy the requirements of the second aspect. Preferred fragments may be 10 to 150 nucleotides in length. More preferably, the fragments are between 5 to10, 5 to 20, 10 to 20, 20 to 50, or 50 to 100 nucleotides in length. For example, the fragments may be 5, 8, 10, 12, 15, 18, 20, 22, 25, 28, 30, or 35 nucleotides in length. The fragments may be useful in a variety of diagnostic, prognostic or therapeutic methods, or may be useful as research tools for example in drug screening.

[0035] In a fourth aspect of the invention, there is provided non-coding, complementary sequences which hybridise to the SEEK1 gene sequence. Such “anti-sense” sequences are useful as probes or primers for detecting an allele of a polymorphism of the invention, for example in the methods of the first aspect or in the regulation of the SEEK1 gene. They may also be used as agents for use in the identification and/or treatment of individuals having or being susceptible to SEEK1 mediated disease.

[0036] The anti-sense sequences of the invention include those which hybridise to an allele of a polymorphism of the invention, and also those which hybridise a region flanking the polymorphic site to enable amplification of an allele of one or more polymorphisms. Preferred anti-sense sequences are the complements of the sequences shown in Table 1(i) column 3 or Table 1(iii), column 3, or more preferably the complement of the sequence upstream and downstream of, and including, the polymorphism the anti-sense sequences may be useful as probes or primers. To be useful as a probe, the anti-sense sequence should bind preferentially one allele of one or more polymorphisms of the present invention and will, preferably, comprise the exact complement of one allele of one or more polymorphisms of the invention. Thus, for example, where the variant comprises a “G” residue at position 16549 of AP000510 (the same as SNP39 of the consensus sequence of FIG. 2), it is preferred that the anti-sense sequence will comprise a “C” residue. Such anti-sense sequences which are capable of specific hybridisation to detect a single base mis-match may be designed according to methods known in the art and described in Maniatis et al., Molecular Cloning: A Laboratory Manual 2^(nd) Edition (1989), Cold Spring Harbor, N.Y. and Berger et al., Methods in Enzymology 152: Guide to Molecular Cloning Techniques (1987) Academic Press Inc. San Diego, Calif., Gibbs et al., Nuc Acids Res., 17: 2437 (1989); Kwok et al., Nucl Acids Res 18: 999; and Miyada et al., Methods Enzymol. 154: 94 (1987). Variation in the sequence of these anti-sense sequence is acceptable for the purposes of the present invention, provided that the ability of the anti-sense sequence to distinguish between alleles of a polymorphism is not compromised. Similarly, variation in the sequence of a primer sequence is acceptable, provided its ability to mediate amplification of a polymorphic site is not compromised. Preferably, a primer sequence will hybridise to the SEEK1 gene under stringent conditions which are defined below.

[0037] In relation to the present invention, “stringent conditions” refers to the washing conditions used in a hybridisation protocol. In general, the washing conditions should be a combination of temperature and salt concentration so that the denaturation temperature is approximately 5 to 20° C. below the calculated T_(m) of the nucleic acid under study. The T_(m) of a nucleic acid probe of 20 bases or less is calculated under standard conditions (1M NaCl) as [4° C.×(G+C)+2° C.×(A+T)], according to Wallace rules for short oligonucleotides. For longer DNA fragments, the nearest neighbour method, which combines solid thermodynamics and experimental data may be used, according to the principles set out in Breslauer et al., PNAS 83: 3746-3750 (1986).

[0038] The optimum salt and temperature conditions for hybridisation may be readily determined in preliminary experiments in which DNA samples immobilised on filters are hybridised to the probe of interest and then washed under conditions of different stringencies. While the conditions for PCR may differ from the standard conditions, the T_(m) may be used as a guide for the expected relative stability of the primers. For short primers of approximately 14 nucleotides, low annealing temperatures of around 44° C. to 50° C. are used. The temperature may be higher depending upon the base composition of the primer sequence used.

[0039] The anti-sense polynucleotides of this embodiment may be the full length of the SEEK1 gene as represented by AP000510 or the consensus sequence of FIG. 2, or more preferably may be 5 to 200 nucleotides in length. Preferred polynucleotides are 5 to 10, 10 to 20, 20 to 50, 50 to 100 or 100 to 200 nucleotides in length. Primers, in particular, are typically 10 to 15 nucleotides long, and may occasionally be 16 to 25.

[0040] In a preferred embodiment, the polynucleotides of the aforementioned aspects of the invention may be in the form of a vector, to enable the in vitro or in vivo expression of the polynucleotide sequence. The polynucleotides may be operably linked to one or more regulatory elements including a promoter; regions upstream or downstream of a promoter such as enhancers which regulate the activity of the promoter; an origin of replication; appropriate restriction sites to enable cloning of inserts adjacent to the polynucleotide sequence; markers, for example antibiotic resistance genes; ribosome binding sites: RNA splice sites and transcription termination regions; polymerisation sites; or any other element which may facilitate the cloning and/or expression of the polynucleotide sequence. Where two or more polynucleotides of the invention are introduced into the same vector, each may be controlled by its own regulatory sequences, or all sequences may be controlled by the same regulatory sequences. In the same manner, each sequence may comprise a 3′ polyadenylation site. The vectors may be introduced into microbial, yeast or animal DNA, either chromosomal or mitochondrial, or may exist independently as plasmids. Examples of suitable vectors will be known to persons skilled in the art and include pBluescript II, LambdaZap, and pCMV-Script (Stratagene Cloning Systems, La Jolla (USA))

[0041] Appropriate regulatory elements, in particular, promoters will usually depend upon the host cell into which the expression vector is to be inserted. Where microbial host cells are used, promoters such as the lactose promoter system, tryptophan (Trp) promoter system, β-lactamase promoter system or phage lambda promoter system are suitable. Where yeast cells are used, preferred promoters include alcohol dehydrogenase I or glycolytic promoters. In mammalian host cells, preferred promoters are those derived from immunoglobulin genes, SV40, Adenovirus, Bovine Papilloma virus etc. Suitable promoters for use in various host cells would be readily apparent to a person skilled in the art (See, for example, Current Protocols in Molecular Biology Edited by Ausubel et al, published by Wiley).

[0042] In a fifth aspect of the present invention there is provided a protein or protein fragment comprising an amino acid substitution, deletion or insertion at one or more of positions 24, 34, 37, 40 or 133 of the amino acid SEEK1 sequence as represented by FIG. 3(i), or a SEEK1 protein fragment having the amino, acid sequence represented by FIG. 3(ii) or 3(iii). Preferably, the protein or protein fragment is encoded by a polynucleotide according to the second aspect of the invention, and comprises a nucleotide insertion, deletion or substitution at one or more of positions 3394, 3364, 3354, 3352 and 2205 of AP000510 (corresponding to positions 26724, 26694, 26684, 26675-26682 and 25534 of FIG. 2). The SEEK1 protein or protein fragments of the invention may comprise one or more additional polymorphisms.

[0043] The amino acid sequence exactly as shown in FIG. 3(i) may be referred to as the reference sequence, and is not part of the invention. The amino acid sequence of FIG. 3(i) having an amino acid substitution, deletion or insertion at one or more of the positions indicated above may be referred to as a variant of FIG. 2(i). The reference amino acid at one or more of the above polymorphic sites may be replaced by any other amino acid residue to produce a variant sequence. Amino acid sequences of FIG. 2(i) having one or more of the polymorphisms disclosed in Table 1 (i) or Table (iii) are each preferred embodiments of the invention.

[0044] Protein fragments may be functional or non-functional and may be useful in drug screening or gene therapy. Functional fragments may be defined as those which have binding and/or immunological characteristics of the SEEK1 protein. The fragments maybe at least 10, preferably at least 15, 20, 25 30, 35, 40 or 50 amino acids in length.

[0045] In a sixth aspect of the present invention, there are provided antibodies which react with an antigen of a protein or protein fragment of the fifth aspect. A preferred antibody for use in the present invention is one which binds to the amino acid sequence: NH₂-Met-Ile-Ser-Lys-Glu-Phe-His-Leu-Ala-Ala-The-Gln-Asp-Asp-Lys-COOH. Antibodies can be made by the procedure set forth by standard procedures (Harlow and Lane, “Antibodies; A Laboratory Manual” Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 1998). Briefly, purified antigen can be injected into an animal in an amount and in intervals sufficient to elicit an immune response. Antibodies can either be purified directly, or spleen cells can be obtained from the animal. The cells are then fused with an immortal cell line and screened for antibody secretion. The antibodies can be used to screen DNA clone libraries for cells secreting the antigen. Those positive clones can then be sequenced as described in, for example, Kelly et al., Bio/Technology 10:163-167 (1992) and Bebbington et, al., Bio/Technology 10:169-175 (1992). The antibody may be specific for the amino acid sequence in question. Preferably, the antibody is sufficiently specific to distinguish between the reference SEEK1 protein and variants thereof, such as those of the fifth aspect. Such antibodies will have use in the first aspect of the invention. Preferably, the antigen being detected and/or used to generate a particular antibody will include proteins or protein fragments according to the fifth aspect.

[0046] In a seventh aspect of the present invention, there is provided host cell comprising a polynucleotide according to any of the aforementioned aspects, for expression of the polynucleotide. The host cell may comprise an expression vector, or naked DNA encoding said polynucleotides. A wide variety of suitable host cells are available, both eukaryotic and prokaryotic. Examples include bacteria such as E. coli, yeast, filamentous fungi, insect cells, mammalian cells, preferably immortalised, such as mouse, CHO, HeLa, myeloma or Jurkat cell lines, human and monkey cell lines and derivatives thereof. Such host cells are useful in drug screening systems to identify agents for use in diagnosis or treatment of individuals having, or being susceptible to SEEK1 mediated disease.

[0047] The method by which said polynucleotides are introduced into a host cell will usually depend upon the nature of both the vector/DNA and the target cell, and will include those known to a person skilled in the art. Suitable known methods include fusion, conjugation, transfection, transduction, electroporation or injection, as described in Sambrook et al.

[0048] In an eighth aspect of the present invention, there is provided a transgenic non-human animal comprising a polynucleotide according to an aforementioned aspect of the invention. Preferably, the transgenic, non-human animal comprises a polynucleotide according to the second or third aspects. Transgenic non-human animals are useful for the analysis of the single nucleotide polymorphisms and their phenotypic effect. Expression of a polynucleotide of the invention in a transgenic non-human animal is usually achieved by operably linking the polynucleotide to a promoter and/or enhancer sequence, preferably to produce a vector of the invention, and introducing this into an embryonic stem cell of a host animal by microinjection techniques (Hogan et al., A Laboratory Manual, Cold Spring harbour and Capecchi Science (1989) 244: 1288-1292). The transgene construct should then undergo homologous recombination with the endogenous gene of the host. Those embryonic stem cells comprising the desired polynucleotide sequence may be selected, usually by monitoring expression of a marker gene, and used to generate a non-human transgenic animal. Preferred host animals include mice and other rodents.

[0049] In a preferred embodiment, the transgenic non-human animal may comprise an anti-sense nucleic acid sequence of the fourth aspect. The expression of an anti-sense sequence in a transgenic non-human animal may be useful in determining the effects of such sequences in treating SEEK1-mediated disease, or in neutralising deleterious effects of variant SEEK1 genes in an animal. Preferably, the host animal will be one which suffers from SEEK1 mediated disease. The disease may be naturally occurring or artificially introduced.

[0050] In some preferred embodiments, for example where the mediated disease has been artificially induced, the transgenic non-human animal will be modulated to no longer expresses the endogenous SEEK1 gene. Such animals may be referred to as “knock out”. In some cases, it may be appropriate to modulate, the expression of the endogenous SEEK1 gene, or express the polynucleotides of the present invention, in specific tissues. This approach removes viability problems if the expression of a gene is abolished or induced in all tissues. Preferably, the specific tissue would be skin.

[0051] In a ninth aspect of the present invention there is provided a method of screening for agents for use in the prognosis, diagnosis or treatment of individuals having, or being susceptible to, SEEK1 mediated disease, said method comprising contacting a putative agent with a polynucleotide or protein according to an aforementioned aspect of the present invention, and monitoring the reaction there between. Preferably, the method further comprises contacting a putative agent with a reference polynucleotide (or fragment thereof as described above) or protein of the consensus sequence in FIG. 2 and FIG. 3(i) respectively, and comparing the reaction between (i) the agent and the reference polynucleotide or protein and (ii) the agent and polynucleotide or protein of the invention. Potential agents are those which react differently with a variant of the invention and a reference allele. It is envisaged that the present method may be carried out by contacting a putative agent with a host cell or transgenic non-human animal comprising a polynucleotide or protein according to the invention. Putative agents will include those known to persons skilled in the art, and include chemical or biological compounds, such as anti-sense polynucleotide sequences, complementary to the coding sequences of the second aspect, or polyclonal or monoclonal antibodies which bind to a product such as a protein or protein fragment of the fifth aspect. The agents identified in the present method may be useful in determining susceptibility to SEEK1 mediated disease, or in the diagnosis, prognosis or treatment of said disease.

[0052] In a tenth aspect of the invention, there is provided a method for diagnosing and treating SEEK1 mediated disease in a subject, wherein the method comprises

[0053] (i) determining which allele of one or more of the polymorphisms of the invention is present; and

[0054] (ii) introducing into the subject a different allele;

[0055] wherein either a variant allele of the present invention is determined, and/or a variant allele of the present invention is introduced into the subject.

[0056] In an alternative embodiment of this aspect, there is provided the use of an allele of one or more polymorphisms of the invention in the manufacture of a medicament for use in the diagnosis and treatment of SEEK1 mediated disease, wherein the method comprises

[0057] (i) determining which allele of one or more polymorphisms of the invention are present; and

[0058] (ii) introducing into the subject a different allele;

[0059] wherein either a variant allele of the present invention is determined, and/or a variant allele of the present invention is introduced into the subject. The medicament may therefore comprise either the variant allele of one or more polymorphisms or the reference allele. In the present invention, treatment includes amelioration of disease.

[0060] This method of diagnosis and treatment may comprise determining and introducing alleles in the form of a polynucleotide or protein. In the above embodiments, the allele of a polymorphism may be determined using any method, such as those of the first aspect discussed above. The other allele may be introduced in the form of a protein, or polynucleotide. Any suitable means for introduction of a protein may be used. Introduction of a polynucleotide may use gene therapy methods including those known in the art. In general, a polynucleotide encoding the allele will be introduced into the target cells of a subject, usually in the form of a vector and preferably in the form of a pharmaceutically acceptable carrier. Any suitable delivery vehicle may be used, including viral vectors, such as retroviral vector systems which can package a recombinant genome. The retrovirus could then be used to infect and deliver the polynucleotide to the target cells. Other delivery techniques are also widely available, including the use of adenoviral vectors, adeno-associated vectors, lentiviral vectors, pseudotyped retroviral vectors and pox or vaccinia virus vectors. Liposomes may also be used, including commercially available liposome preparations such as Lipofectin® Lipofectamine®, (GIBCO-BRL, Inc. Gaitherburg, Md.), Superfect® (Qiagen Inc, Hilden, Germany) and Transfectam® (Promega Biotec Inc, Madison Wis.).

[0061] The polynucleotide or vehicle may be administered parenterally (eg, intravenously), transdermally, by intramuscular injection, topically or the like. As SEEK1 mediated diseases are usually manifested in the skin, topical administration is preferred. The exact amount of polynucleotide or vehicle to be administered will vary from subject to subject and will depend upon age, weight, general condition, and severity or mechanism of the disorder.

[0062] In a further aspect, the present invention provides a kit for the detection in a subject of a single nucleotide polymorphism according to the present invention. Preferably, the kit will contain polynucleotides according to the aforementioned aspects, most preferably the anti-sense sequences of the third aspect for use as probes or primers; antibodies of the sixth aspect; or restriction enzymes for use in detecting the presence of a polynucleotide, protein or protein fragment of the invention. Preferably, the kit will also comprise means for detection of a reaction, such as nucleotide label detection means, labelled secondary antibodies or size detection means. In yet a further preferred embodiment, the polynucleotides, or antibodies may be fixed to a substrate, for example an array, as described in WO95/11995.

[0063] The preferred embodiments of each aspect apply to the other aspects of the invention, mutatis mutandis.

[0064] The present invention will now be described by way of a non-limiting example, with reference to the following figures in which:

[0065]FIG. 1 shows the nucleotide sequence of the mRNA clone of the SEEK1 gene, of GenBank Accession No. AB031479. On this figure are indicated:

[0066] The start and stop codons, atg and taa respectively, underlined at positions 275nt and 731nt.

[0067] The sites of amino acid polymorphisms (in bold shading) at positions 24, 34, 37 and 133.

[0068] The run of polymorphic cytosine residues (in bold shading) starting at nucleotide position 386.

[0069]FIG. 2 shows a consensus genomic DNA sequence of SEEK1.

[0070]FIG. 3(i) shows the amino acid sequence of the reference SEEK1 protein.

[0071]FIG. 3(ii) shows the amino acid sequence of the variant SEEK1 protein with polymorphism SEEK1C3352Del[C]₆ and coding sequence therefor. The deletion of a cytosine residue causes a frame shift mutation and premature truncation of the predicted protein—the altered amino acids, relative to the wild type sequence, are shaded.

[0072]FIG. 3(iii) shows the amino acid sequence of the variant SEEK1 protein with polymorphism SEEK1C3352Ins[C]₈ and coding sequence therefor. The insertion of a cytosine residue causes a frame shift mutation and premature truncation of the predicted protein—the altered amino acids, relative to the wild type sequence, are shaded.

[0073]FIG. 4 shows the deduced exonic sequences of SEEK1.

[0074]FIG. 5 shows the western blot of epidermal proteins probed with rabbit anti-1gG.

[0075]FIG. 6 shows a western blot of proteins of normal human skin cells probed with anti-SEEK1 peptide 590 serum.

EXAMPLES Determination of Gene Structure

[0076] The mRNA sequence of the SEEK1 gene (GenBank Accession ID AB031479) was used to screen the following public DNA databases: (available through the National Centre for Biotechnology Information website—http://www.ncbi.nlm.nih.gov/); NR (Non-Redundant DNA), HTGS (High Throughput Genomic Sequence), dbEST (Expressed Sequence Tag) and GSS (Genome Survey Sequence). The analysis was performed using the BLASTN algorithm (Altschul, et al., (1990) J. Mol. Biol. 215:403-410). Any genomic sequences containing the SEEK1 gene were identified by their degree of sequence identity. The gene structure was determined by comparison of the mRNA sequence with the genomic clones. The deduced exon-intron organisation of the SEEK1 gene is presented in FIG. 4. The exon locations in the consensus are as folows: SEEK Gene Consensus Genomic DNA Position bp Exon 1 50586-50541 Exon 2 39754-39595 Exon 3 35841-35765 Exon 4 26978-26948 Exon 5 26751-26627 Exon 6 25765-25345

[0077] Oligonucleotide Primer Design for SEEK1 Gene Sequencing

[0078] 6 pairs of oligonucleotide primers (Seek1×2F/Seek1×2R; Seek1×3F/Seek1×3R; Seek1×4/5F/Seek1×4/5R; Seek1×6F/Seek1×6R; Seek1×5F/Seek1×5R; Seek1×4F/Seek1×4R—Table 1) were designed to amplify exons 1 to 6 of the SEEK1 gene. Oligonucleotide primer sequences were derived from human chromosome 6p21 genomic DNA sequence (GenBank Accession AP000510). !Oligo Name? Sequence 5′ to 3′ Seek1x2 F DAGGTGTTCCGAACATCTCTGC Seek1x2 R DACAGCCTGGACACATTCTTCC Seek1x3 F DAAGACAGCCTGTTTGAGTGC Seek1x3 R DTGTATCCCTTCCTTCTCTCC Seek1x4/5 F DAAAGGTAAGAGGTGGTGAGG Seek1x4/5 R DATCTGGCTCACCAGAAATGG Seek1x6 F DTTTCTAAACCTGGGATGCAGC Seek1x6 R DAGATGAGATCACGCCATTGC Seek1x4 F DATGCCTGTAAAGGAGGAAGG Seek1x4 R DAAAGTGGGTCAAGTGAACGG Seek1x5 F DTAAGCTCCATCCACCCCTGG Seek1x5 R DAACTGGACGCATGGGGTTGG Seek1x6 F2 DATGGGATCCAGGCATCCTGC Seek1x6 R2 DTTTGGACAGGGTGTGGAGGG

[0079] SEEK1 Gene Amplification

[0080] Genomic DNA from a panel of 24 unrelated individuals was amplified using primer pairs (Seek1×2F/Seek1×2R; Seek1×3F/Seek1×3R; Seek1×4/5F/Seek1×4/5R; Seek1×6F/Seek1×6R; Seek1×5F/Seek1×5R; Seek1×4F/Seek1×4R—Table 1). 100 ng genomic DNA was amplified by PCR in a total reaction volume of 25 μl containing 50 mM KCl, 20 mM Tris.HCl (pH 8.4), 2 mM MgCl₂ 200 μM each dATP, dCTP, dGTP, dTTP, 1 μM each oligonucleotide primer and 0.5 units AmpliTaq Gold DNA polymerase (Applied Biosystems). Reactions were thermocycled with an initial denaturation step of 95° C./10 mins followed by 35 cycles of 94° C./30 secs; T_(m) annealing/30 secs; 72° C./30 secs. A final elongation step of 72° C./10 mins completed the amplification. TABLE 2 Primers and Amplimer Sizes. Fragment Forward Primer Reverse Primer Product size (bp) Exon 2 Seek1x2 F Seek1x2 R 402 Exon 3 Seek1x3 F Seek1x3 R 294 Exon 4 and 5 Seek1x4/5 F Seek1x4/5 R 531 Exon 6 Seek1x6 F Seek1x6 R 627 Exon 5 Seek1x5 F Seek1x5 R 340 Exon 4 Seek1x4 F Seek1x4 R 241

[0081] Heteroduplex Analysis Using DHPLC:

[0082] Oligos were designed to amplify products of between 241-627 bp in length from the genomic DNA of 24 individuals. Denaturing high-performance liquid chromatography (DHPLC) analysis was performed using the WAVE™ DNA fragment analysis system (Transgenomic) (Kuklin, et al., (1997-98) Genet Test. 1(3): 201-6). The temperature required for successful resolution of heteroduplex molecules within each PCR product was determined empirically by injecting PCR product at a series of increasing mobile phase temperatures and constructing a fragment specific melting curve. A universal gradient for double stranded DNA was used to determine the appropriate acetonitrile concentration for the heteroduplex identification. For mutation detection, 1-2 μl aliquots of the PCR reactions from each of the eleven individuals were injected onto the WAVE™ column. Mutation detection gradients were for four minutes. Results were graphically visualised using the D-7000 HSM software (Transgenomic).

[0083] Direct Sequencing of PCR Products

[0084] To define the exact nature of the polymorphisms identified by DHPLC heteroduplex analysis, 50-100 ng of PCR products were sequenced in both orientations using the DYEnamic ET terminator cycle sequencing premix kit from Amersham. Reactions were fractionated on ABI 377 automated sequencers using standard procedures. Chromatographic traces were analysed using the SEQUENCHER programme (Gene Codes, USA), to identify SNP positions.

[0085] The single nucleotide polymorphisms of the SEEK1 gene, including those of the present invention, are listed in Tables 1 (i) and (ii) where:

[0086] Column 1 of (i) provides the name and positional reference of the polymorphism with respect to the reference genomic DNA sequence AP000510, together with details of the polymorphism itself For example, the reference “G16549A” indicates a substitution of the nucleotide “G” for nucleotide “A” at position 16549 of AP000515.

[0087] Column 2 of (i) provides the positional reference of the polymorphism with respect to the reference mRNA sequence AB031479.

[0088] Column 3 of (i) shows the sequence flanking the polymorphism, the polymorphism itself being shown in underlined type.

[0089] Column 4 of (i) shows the IUB code of each single nucleotide polymorphism.

[0090] Columns 5 and 6 of (i) shows the effect of each single nucleotide polymorphism on the amino acid sequence of SEEK1.

[0091] Columns 2 and 3 of (ii) show the forward and reverse primers which may be used to amplify a region of the SEEK1 gene to enable detection of the single nucleotide polymorphisms detailed in Column 1 of (ii).

[0092] Column 4 of (ii) shows the size in base pairs of the amplified products.

[0093] The single nucleotide polymorphisms of the SEEK1 gene, as set out in Tables (i) and (ii) are also listed in Table 3(iii) which includes additional single nucleotide polymorphisms. In Table 3:

[0094] Column 1 provides SNP number.

[0095] Column 2 provides the SNP location in the SEEK1 gene.

[0096] Column 3 shows the flanking sequence of the polymorphism, the polymorphism itself being underlined.

[0097] Column 4 shows the IUB code of each single nucleotide polymorphism.

[0098] Column 5 shows the nucleotide position in AB031479 (see FIG. 1).

[0099] Column 6 shows any resulting amino acid polymorphism.

[0100] Column 7 shows the nucleotide position in the consensus genomic DNA sequence (FIG. 2).

[0101] Columns 8 and 9 show statistical significance of the polymorphism association.

Detection of Polymorphisms in 24 Population Controls

[0102] Allele frequencies of the SEEK1C3352Ins/Del[C] polymorphism were determined in 24 population controls by direct DNA sequencing of PCR products generated using primers Seek1×5F and Seek1×5R.

[0103] SEEK1C3352Del [C]₆-16.5%

[0104] SEEK1C3352WT[C]₇-74.0%

[0105] SEEK1C3352Ins [C]₈-9.50%

[0106] Production of an Antibody to SEEK1 Protein

[0107] The following peptide, termed 590-THY, was synthesised and coupled to thyroglobulin for SEEK1 anti-sera production.

[0108] NH₂-Met-Ile-Ser-Lys-Glu-Phe-His-Leu-Ala-Ala-Thr-Gln-Asp-Asp-Cys-COOH

[0109] Two rabbits were immunised with peptide 590-THY for polyclonal production. Two immunisations are performed at 4 weeks intervals with two sample bleeds for testing

[0110] ELISA

[0111] Sera from the two immunised rabbits and four mice were tested for their reactivity to the peptide 590. Each time the controls (rabbit and mice not immunised) were tested at different (serial) dilutions. The optimum dilution was approximately 1:10000. Control serum was negative.

[0112] Western Blotting

[0113] Proteins were extracted from the epidermis. Electrophoresis and transfer were performed according to standard techniques. The blotting membrane was incubated with the primary antibody 590 (1:1000) then with second antibody, rabbit anti-IgG, coupled to biotin. Detection was performed using ECL-plus reagents (Pharmacia). A band was detected at approximately 40 kDa, possibly representing a glycosylated form of the protein (FIG. 5).

[0114] Western blotting was also performed with proteins extracted from several normal human skin cell lines including neonatal keratinocytes (NHEK-Neo), adult keratinocytes (NHEK-Ad), neonatal pooled, neonatal dermal fibroblasts (NHDF-Neo) and adult dermal fibroblasts (NHDF-Ad), neonatal dermal microvascular endothelial cells (HMVEC-Neo) and adult), dermal microvascular endothelial cells (HMVEC-Ad) and epidermal melanocytes (NHEM-Neo). Four bands were observed at molecular weights 22-30 kDa, 17-22 kDa, 6-17 kDa and 4 kDa (FIG. 6).

[0115] SEEK 1 Gene Association With Psoriasis

[0116] SEEK 1 gene polymorphisms were genotyped in 147 families identified through a proband with psoriasis (a total of 499 individuals, of whom 233 were affected). Genotyping was performed using Pyrosequencing (Ahmadian A et a;., Anal Biochem 2000 280:103-110), an established genotyping technology well known to those skilled in the art.

[0117] Single Point Association

[0118] Single point associations between each polymorphism and psoriasis affected status were calculated using the TRANSMIT program (Clayton D, MRC Biostatistics Unit, Cambridge). P values <0.05 and corresponding chi squared values are provided in Table 1(iii). Highly significant associations were observed between SNPs 16, 17, 19, 20, 21, 22, 23, 24, 25, 40, 46, 56 and psoriasis. The single SNP showing the most significant association with psoriasis is SNP 24. There are no published data reporting the association of SEEK1 gene SNPs and psoriasis. This study has identified at least 12 SNPs that are powerfully predictive of affected status. TABLE 1 (i) SNP position and details AB031479 IUB Amino Acid SNP name nt position SNP Code Effect Position G16549A n/a CCCCAATCAGGTGTTCCGAACATCTCTGCG G/A R Non- [G/A]GACTGACCCTCCTCAGCCCAGGTGCTCC[C/T] coding G16548A n/a CCCAATCAGGTGTTCCGAACATCTCTGCG[G/A] G/A R Non- GACTGACCCTCCTCAGCCCAGGTGCTCC[C/T]A coding C16519T n/a [G/A][G/A]GACTGACCCTCCTCAGCCCAGGTGCTCC C/T Y Non- ATGGGACTGGCTACACTTCTTGACTCAGTT coding A16393G n/a GTAGACGATCAAGGGTGGAATCTACAGTCC A/G R Non- TG[G/A]GCCCTGACTTCTTGCCTTC[G/A]TCTCAAA coding G16390A n/a GACGATCAAGGGTGGAATCTACAGTCC[A/G]TG G/A R Non- GCCCTGACTTCTTGCCTTC[G/A]TCTCAAATAG coding G16370A n/a ACAGTCC[A/G]TG[G/A]GCCCTGACTTCTTGCCTTC G/A R Non- TCTCAAATAGACTCTGCAGCCAGCCATCTA coding C12553T n/a ATTAATAGGTACTAAAATCTCCAATTGCCT C/T Y Non- ATGCCTCCCCCTTCTCTTTCCCACTCACCT coding G12401T n/a GTGAGTTACCTCTCTCAGTGTTGGTTCCTC G/T K Non- TCTGTGAAATGGGGCTAATCATTTGCTTTA coding T3676C n/a CCCAGCCCCAGGAGGAGGAGCCTGTCTGGA T/C Y Non- GGACGCAGCCTGAACTGACCCACAAACAGA coding A3585G n/a TAGGTTTGTAAATACTTAACTGATGGTAAA A/G R Non- TGTCATGAACCCCTACCCCCGATGGATCTG coding A3444G n/a GCTTTGTCCTCAGGCCAACCTGCAACCCAA A/G R Non- GTGGGTTACACCTTGGCCCCCAGGCACACA coding C3394A 344 CCAGGCACACAGACCCCAGCTTTACAAGGA C/A M Amino Pro24Thr CCCAGCTCCTTAACACAGATCCCAGCTCC[G/A] acid sub- stitu- tion G3364A 374 [C/T]CCCAGCTCCTTAACACAGATCCCAGCTCC G/A R Amino Lys34Glu AGGAAACTC[G/A]T[:CCCCCCC]ACGTTAATCCT acid sub- stitu- tion G3354A 384 TTAACACAGATCCCAGCTCC[G/A]AGGAAACTC G/A R Amino Arg37His T[:CCCCCCC]ACGTTAATCCTGACCGACTT acid sub- stitu- tion C3352Ins/Del 386 AACACAGATCCCAGCTCC[G/A]AGGAAACTC[G/A]T (C)7[wild type] n/a (C)7- See FIGURE ACGTTAATCCTGACCGACTTTGCCACATGG wild 2 (i) type pep- tide C3352Ins/Del 386 AACACAGATCCCAGCTCC[G/A]AGGAAACTC[G/A]T (C)6[DEL] n/a (C)6- See FIGURE ACGTTAATCCTGACCGACTTTGCCACATGG trun- 2 (ii) cated pep- tide C3352Ins/Del 386 AACACAGATCCCAGCTCC[G/A]AGGAAACTC[G/A]T (C)8[INS] n/a (C)8- See FIGURE ACGTTAATCCTGACCGACTTTGCCACATGG trun- 2 (iii) cated pep- tide A3247T n/a TCTGCACCATGTCCCCCACCCAATGTGTCC A/T W Non- GAAAGCCATTTCTGGTGAGCCAGATGCACC coding G3210A n/a CATTTCTGGTGAGCCAGATGCACCTTCTGC G/A R Non- TCCCCTGAATTCCTGTCCCCAACCCCATGC coding C2205T 673 TCCACCTATCCGCCTCTAGGACCTTGGCTC C/T Y Amino Pro133Leu AACTCTATTGTACTCGTCTCCTCCCTCCCA acid sub- stitu- tion C2126Del[C] 748 CTCCTTGATCTAAGCCTCCCAGAGAGACCC C[INS/DEL] n/a 3′UTR TAGAA[C/T]GTTTCCCTCAAGGACCTTTCTGCC C2120T 757 GATCTAAGCCTCCCAGAGAGACCC[C]TAGAA C/T Y 3′UTR GTTTCCCTCAAGGACCTTTCTGCCTGGAAG

[0119] TABLE 1 (ii) Primers for SNP amplification Forward Reverse SNP name Primer primer Product size (bp) G16549A Seek1x2 F Seek1x2 R 402 G16548A Seek1x2 F Seek1x2 R 402 C16519T Seek1x2 F Seek1x2 R 402 A16393G Seek1x2 F Seek1x2 R 402 G16390A Seek1x2 F Seek1x2 R 402 G16370A Seek1x2 F Seek1x2 R 402 C12553T Seek1x3 F Seek1x3 R 294 G12401T Seek1x3 F Seek1x3 R 294 T3676C Seek1x4 F Seek1x4 R 241 A3585G Seek1x4 F Seek1x4 R 241 A3444G Seek1x5 F Seek1x5 R 340 C3394A Seek1x5 F Seek1x5 R 340 G3364A Seek1x5 F Seek1x5 R 340 G3354A Seek1x5 F Seek1x5 R 340 C3352Ins/Del[C] Seek1x5 F Seek1x5 R 340 SEEK1x5.A265T Seek1x5 F Seek1x5 R 340 SEEK1x5.G302A Seek1x5 F Seek1x5 R 340 SEEK1x6.C273T Seek1x6 F2 Seek1x6 R2 627 SEEK1x6.C352Del[C] Seek1x6 F2 Seek1x6 R2 627 SEEK1x6.C358T Seek1x6 F2 Seek1x6 R2 627

[0120] TABLE 1(iii) Nucleotide SNP Nucleotide Amino position in Location SNP position Acid consensus Chi in IUB in Poly- genomic DNA p squared SNP SEEK gene Sequence context of SNP Code AB031479 morphism sequence value value 1 Promoter GAAATAGCCACYTTCTCCCAAGGTTTCTTATACTCTR R N/A N/A 51814 TGGCACATCTGACCACCAGTAGCAGGCAGAATGATGT 2 Promoter CTCCTCTACTGTTACTTGGAAATAGCCACYTTCTCCC Y N/A N/A 51789 AAGGTTTCTTATACTCT 3 Promoter GATCAAGTCCTGGCCATTTGACAGCAGCATTTAAAG Y N/A N/A 51789 GCYCTCCTCTACTGTTACTTGGAAATAGCCACYTTC TCCCAAGGT 4 Promoter CATGTTTAGACCTTGGGCAGCCAGGGAAGCYTACTC Y N/A N/A 51570 CTGGGGCCTCCCGGAAGCCATGGAGAGAAC 5 Promoter CTCTTCACTCCTCCAGTGGTTAAGCCAGCAGGGGCA Y N/A N/A 51505 GGVGGGGAGGAGACAGCAGTAGAATCAGCCAACAG CTCAT 6 Promoter AGGCCTCTGGGCTCCATCCACTGCCAGTTCTGGAG W N/A N/A 51462 WGGAGCTCTTCACTCCTCCAGTGGTTAAGCCAGCA 7 Promoter ACATTGACCAGAAAGGGATTGAATCACCCTTGGTCC R N/A N/A 51265 AGCRTCTGGCCCCTGATCTGCAGCCAATGGCAGGA ATCGAGGTC 8 Promoter TGAATTTAGAACTGTTGAAACTCCAAGTCTGGAATCA R N/A N/A 51216 GCARAAATGTATTACATTGACCAGAAAGGGATTGAA TCACCCT 9 Promoter CTCAGAGCCTCTGCTTGGCTGCAAAGGAATTCACCC Y N/A N/A 51124 YTACTGTAGCACTTAACCCATTCCCTCCTATCAGGGT GG 10 Promoter GGATTGTGCTTGTCCCTGTAGGAGCCCCACCCCCC Y N/A N/A 51078 ACCCYAGGCCACCTCTCAGAGCCTCTGCCTTGGCTG CAAAGG 11 Promoter TGAGACAGGCAGGGAGAGGCTGAGGCGGASGAAGT Y N/A N/A 51017 TCCYGCATCCCAAGGAGGGCAGAGTGGATTGTGCT TGTCC 12 Promoter GACTTAAGTCCTGAGACAGGCAGGGAGAGGCTGAG S N/A N/A 51008 GCGGASGAAGTTCCYGCATCCCAAGGAGGGCAGAG TGGATT 13 Promoter GCTGAGAAGGCAGAGTGCCCCMGTGGGAAAGAGG Y N/A N/A 50920 AGTCGCYTCCACTGGAGAAGAGAGAGAAAGTGGAG TGTGTGGTG 14 Promoter AACATGGCTCTCAGGTGAGGGCTGAGAAGGCACAG M N/A N/A 50901 TGCCCCMGTGGGAAAGAGGAGTCGCYTCCACTGGA GAAGAGAGA 15 Promoter TAGATCAAGAGGCCCAGCCTGTGGCAGAACAGAGC R N/A N/A 50801 TGCCRGTGGTCTGTCCATCTTCACACTCCCTGCTCT GCTGGGGT 16 Intron 1 TCTCAGCCCCTTCCTGTGGCCATTTCCCTCAGTGCY Y N/A N/A 50049 0 28.13 CAGATGATTCCCTGGGTGAGGGAGACACTGGGGCA CCCTC 17 Intron 1 TACCCCAAGGAGAGTTACTCGACAGTCCAT[AAG]AA INS/ N/A N/A 49405-49407 0 18.31 GTCAACTGTTGTGTGTGTGCATGCCTTGGGCACAAA DEL 18 Intron 1 AGTTCCCAATCSAGTGGCAAAATCATCCTTCAGCCTT Y N/A N/A 49160 GYGGCAGCAAGTCCAGCTCTTCTGGTCACCCTTGC 19 Intron 1 GGCACCGCCTCCTTCAGCAGCAGCTCCAGTTCCCA S N/A N/A 49133 0.001 11.01 ATCSAGTGGCAAAATCATCCTTCAGCCTTGYGGCAG CAA 20 Intron 1 CCAGCRGTTCTAGCATTTCCAGCAGCKCCGGTTYAC Y N/A N/A 49045 0.003 9.7 CCTACCATCCCTGCGGCAGTGCTTGCCAGAG 21 Intron 1 CTCGAGTCCCCAGCRGTTCTAGCATTTCCAGCAGCK K N/A N/A 49038 0.05 4.55 CCGGTTYACCCTACCATCCCTGCGGCAGTGCTT 22 Intron 1 CCAAGGGACCCTGCTCTCCCTCCAGTTCTCGAGTCC R N/A N/A 49017 0.0008 11.16 CCAGCRGTTCTAGCATTTCCAGCAGCKCCGGTTYAC CCTA 23 Intron 1 ACCCCATCATCCCCAGCGAGTCGGCAGCTTCCTCG R N/A N/A 48920 0.0007 12.12 GCCATTGCRTTCCAGCCAGTGGGGACTGGTGGGGT CCAGC 24 Intron 1 CCAGGCATGACCTACAGTAAGGGTAAAATCTAYCCT Y N/A N/A 48773 0 36.5 GTGGGCTACTTCACCAAAGAGAACCCTGTGAAAGG 25 Intron 1 CAGGGACCTTGGCTAAGAGCATTGGCACCTTCTCAG Y N/A N/A 47938 0.0003 12.81 ACCYCTGTAAGGACCCCACGCGTATCACCTCCCCTA ACGACCCCT 26 Intron 1 KACTGAGATAAGGCAGAAAGGTGAGGRAGGAAGCC Y N/A N/A 47868 AAGCCTCYTTGGCCCTTACTAACCACTGCTTTCCTC CACAGGGACCTTG 27 Intron 1 AGAGGCCGATKACTGAGATAAGGCAGAAAGGTGAG R N/A N/A 47852 GRAGGAAGCCAAGCCTCYTTGGCCCTTACTAACCAC TG 28 Intron 1 CCCTGCGCTCTGCTTGGGAGAAACCCGAGAGGCCG K N/A N/A 47826 ATKACTGAGATAAGGCAGAAAGGTGAGGRAGGAAG CCA 29 Intron 1 TCAATGTATTCCTTTGAGGYCACTCACTTTGGCACST S N/A N/A 47661 AATTTTCTATTTTTCTGGTTGGTGTTTGCCCACCCTT 30 Intron 1 AGCCCCCTCTTATATTCAATGTATTCCTTTGAGGYCA Y N/A N/A 47645 CTCACTTTGGCACSTAATTTTCTATTTTTCTGGTTG 31 Intron 1 TCTTGAACTCTGGGGCRCATGCAATCCTCCCACCTC R N/A N/A 47567 RGCCTCCCAAAGTGCTGGGATTACCGGCGTGAGCC ACT 32 Intron 1 GGGTCTATGTTGCCCAGGCTGGTCTTGAACTCTGGG R N/A N/A 47547 GCRCATGCAATCCTCCCACCTCRGCCTCCCAAAGTG CTGG 33 Intron 1 AAAAAAATTTTAATTAAAAAACAAAATACAGAYRGGG R N/A N/A 47508 TCTATGTTGCCCAGGCTGGTCTTGAACTCTGGGGCR C 34 Intron 1 AAAAAAATTTTAATTAAAAAACAAAATACAGAYRGGG Y N/A N/A 47507 TCTATGTTGCCCAGGCTGGTCTTGAACTCTGGGGCR 35 Intron 1 CTGTCTCTTCAGGGTCCTTTCTTTTAGACCTAYTTGT Y N/A N/A 47438 TCCTGCCCGTTCTCCATTCCCTCTTCTTTT 36 Intron 1 GGAGGAACCAYGGGGTAAGTTGGGCCTGGGGTTTT S N/A N/A 46831 SAGCAAAGGAAAGGAAAGATAAGGAAAGATGTGGCT C 37 Intron 1 CAGAAGGAACGCAGGWGAAAGAGTCATGGAGGAAC Y N/A N/A 46806 CAYGGGGTAAGTTGGGCCTGGGGTTTTSAGCAA 38 Intron 1 CTGGAGGGGCTAGGGAAGGCAGAAGGAACGCAGG W N/A N/A 46784 WGAAAGAGTCATGGAGGAACCAYGGGGTAAGTTGG GCCTGG 39 Intron 1 AGGTGTTCCGAACATCTCTGCGRRGACTGACCCTCC R N/A N/A 39881 TCAGCCCAGGTGCTCCYATGGGACTGGCTACACTTC TTGACTCAGTTTTAATCTCTCCTTCTCTGCCTTCCTG TTGGGAATACCCCCTCACTTCTGTGGCTTCTTTCCT GTAGTAGACGATCAAGGGT 40 Intron 1 AGGTGTTCCGAACATCTCTGCGRRGAGTGACCCTCC R N/A N/A 39880 0 29.56 TCAGCCCAGGTGCTCCYATGGGACTGGCTACACTTC TTGACTCAGTTTTA 41 Intron 1 TCTCTGCGRRGACTGACCCTCCTCAGCCCAGGTGC Y N/A N/A 39851 TCCYATGGGACTGGCTACACTTCTTGACTCAGTTTT 42 Exon 2 TCAAGGGTGGAATCTACAGTCCRTGRGCCCTGACTT R N/A N/A 39725 CTTGCCTTCRTCTCAAATAGACTCTGCAGCCAGCCA TCTATGCAGCGC 43 Exon 2 GGGTGGAATCTACAGTCCRTGRGCCCTGCTTCTTG R N/A N/A 39722 CCTTCRTCTCAAATAGACTCTGCAGCCAGCCATCTA TGCAGCGC 44 Exon 2 ATCTACAGTCCRTGRGCCCTGACTTCTTGCCTTCRT R N/A N/A 39702 CTCAAATAGACTCTGCAGCCAGCCATCTATGCAGCG CCCCAGTGGC 45 Intron 2 CCTATTAATAGGTACTAAAATCTCCAATTGCCTYATG Y N/A N/A 35884 CCTCCCCCTTCTCTTTCCCACTCACCTACCTGCCAT GTCAGCC 46 Intron 3 GGCACTTGTGATATGACTTGCACAGGTGAGTTACCT K N/A N/A 35732 0.001 11.05 CTCTCAGTGTTGGTTCCTCKTCTGTGAAATGGGGCT AATCATTTGCTTTATTG 47 Intron 3 CAGCCCCACCCAGCCCCAGCCCCAGGAGGAGGAG Y N/A N/A 27006 CCTGTCTGGAYGGACGCAGCCTGAACTGACCCACA AACAGACCAAAAAA 48 Intron 4 ACCAAAAAAGTCACTCTCAAAGAGCTCTCGGTAGGT R N/A N/A 26915 TTGTAAATACTTAACTGATGGTAAARTGTCATGAACC CCTACCCCCGATGGATCTGAACCGTTCACTTGACCC ACTTT 49 Intron 4 CACTAGCTTTGTCCTCAGGCCAACCTGCAACCCAAR R N/A N/A 26770 GTGGGTTACACCTTGGCCCCCAGGCACACAGACCC CAGCTTTACA 50 Exon 5 TCAGGCCAACCTGCAACCCAARGTGGGTTACACCTT M 344 Pro24Thr 26724 GGCCCCCAGGCACACAGACCCCAGCTTTACAAGGA MCCCAGCTCCTTAACACAGATCCCAGCTCCRAGGAA ACTCGT:CCCCCCCACGTTAATCCT 51 Exon 5 TCACAGACCCCAGCTTTACAAGGAMCCCAGCTCCTT R 374 Lys34Glu 26694 AACACAGATCCCAGCTCCRAGGAAACTCRT:CCCCC CCACGTTAATCCTGACCGACTTTGCCACATGGAGCC AGCAAACCATT 52 Exon 5 TTAACACAGATCCCAGCTCCRAGGAAACTCRT[:CCC R 384 Arg37His 26684 CCCC]ACGTTAATCCTGACCGACTTT 53 Exon 5 AACACAGATCCCAGCTCCRAGGAAACTCR[C]CCCC INS/ 386-392 See FIG. 26675-26682 CCCACGTTAATCCTGACCGACTTTGCCACATGG DEL 2(i) 54 Intron 5 AGCCAAATGCACCTTCTGCACCATGTCCCCCACCCA W N/A N/A 26576 ATGTGTCCWGAAAGCCATTTCTGGTGAGCCAGATG CACCTTCTGCRTCCCCTGAATTCCTG 55 Intron 5 GCACCATGTCCCCCACCCAATGTGTCCWGAAAGCC R N/A N/A 26539 ATTTCTGGTGAGCCAGATGCACCTTCTGCRTCCCCT GAATTCCTGTCCCCAACCCCATGCGTCCAGTT 56 Exon 5 TCCTCCCTCAGGAATCCACCTATCCGCCTCTAGGAC Y 672 Pro133Leu 25534 0.03 5.5 CTTGGCTCYAACTCTATTGTACTCGTCTCCTCCCTCC CATTCTCCTTTTGGTC 57 Exon 6 CCTCCCATTCTCCTTTTGGTCTCAGCTCCTTGATCTA INS/ 748 3′UTR 25458 AGCCTCCCAGAGAGACCC[C]TAGAAYGTTTCCCTCA DEL AGGACCTTTCTGC 58 Exon 6 ATTCTCCTTTTGGTCTCAGCTCCTTGATCTAAGCCTC Y 757 3′UTR 25449 CCAGAGAGACCCCTAGAAYGTTTCCCTCAAGGACCT TTCTGCCTGGA

[0121]

1 109 1 861 DNA Homo sapiens CDS (275)..(730) 1 caggaaatcg agactcatga ctcccagaga ggatggcatc tagaagtaga cgatcaaggg 60 tggaatctac agtccatggg ccctgacttc ttgccttcgt ctcaaataga ctctgcagcc 120 agccatctat gcagcgcccc agtggctttg aaatgcaaca gaaaccatca cccccggacc 180 gtgggctcca tgccagtggg caaagcacag cctgggaaga attggtttgc agccaggcag 240 tcctccatcc agtcttgact ttggcacttg tgat atg act tgc aca gac caa aaa 295 Met Thr Cys Thr Asp Gln Lys 1 5 agt cac tct caa aga gct ctc ggc aca cag acc cca gct tta caa gga 343 Ser His Ser Gln Arg Ala Leu Gly Thr Gln Thr Pro Ala Leu Gln Gly 10 15 20 ccc cag ctc ctt aac aca gat ccc agc tcc aag gaa act cgt ccc ccc 391 Pro Gln Leu Leu Asn Thr Asp Pro Ser Ser Lys Glu Thr Arg Pro Pro 25 30 35 cac gtt aat cct gac cga ctt tgc cac atg gag cca gca aac cat ttc 439 His Val Asn Pro Asp Arg Leu Cys His Met Glu Pro Ala Asn His Phe 40 45 50 55 tgg cat gca ggg gac ctc caa gca atg ata tcc aag gaa ttc cat ctg 487 Trp His Ala Gly Asp Leu Gln Ala Met Ile Ser Lys Glu Phe His Leu 60 65 70 gca gcc acc cag gat gac tgc aga aaa gga agg aca cag gag gat atc 535 Ala Ala Thr Gln Asp Asp Cys Arg Lys Gly Arg Thr Gln Glu Asp Ile 75 80 85 ctg gtt ccc tct tcc cac cca gag ctg ttt gca tca gtc ctg cca atg 583 Leu Val Pro Ser Ser His Pro Glu Leu Phe Ala Ser Val Leu Pro Met 90 95 100 gct ccg gaa gaa gct gcc agg ctc cag caa cct cag ccc ctt cct cct 631 Ala Pro Glu Glu Ala Ala Arg Leu Gln Gln Pro Gln Pro Leu Pro Pro 105 110 115 ccc tca gga atc cac cta tcc gcc tct agg acc ttg gct cca act cta 679 Pro Ser Gly Ile His Leu Ser Ala Ser Arg Thr Leu Ala Pro Thr Leu 120 125 130 135 ttg tac tcg tct cct ccc tcc cat tct cct ttt ggt ctc agc tcc ttg 727 Leu Tyr Ser Ser Pro Pro Ser His Ser Pro Phe Gly Leu Ser Ser Leu 140 145 150 atc taagcctccc agagagaccc ctagaatgtt tccctcaagg acctttctgc 780 Ile ctggaagtct gttagccttt cagaagtaac atgtccaaaa taaaatttga ttcctcccag 840 gttgttccct gcctggtccg c 861 2 152 PRT Homo sapiens 2 Met Thr Cys Thr Asp Gln Lys Ser His Ser Gln Arg Ala Leu Gly Thr 1 5 10 15 Gln Thr Pro Ala Leu Gln Gly Pro Gln Leu Leu Asn Thr Asp Pro Ser 20 25 30 Ser Lys Glu Thr Arg Pro Pro His Val Asn Pro Asp Arg Leu Cys His 35 40 45 Met Glu Pro Ala Asn His Phe Trp His Ala Gly Asp Leu Gln Ala Met 50 55 60 Ile Ser Lys Glu Phe His Leu Ala Ala Thr Gln Asp Asp Cys Arg Lys 65 70 75 80 Gly Arg Thr Gln Glu Asp Ile Leu Val Pro Ser Ser His Pro Glu Leu 85 90 95 Phe Ala Ser Val Leu Pro Met Ala Pro Glu Glu Ala Ala Arg Leu Gln 100 105 110 Gln Pro Gln Pro Leu Pro Pro Pro Ser Gly Ile His Leu Ser Ala Ser 115 120 125 Arg Thr Leu Ala Pro Thr Leu Leu Tyr Ser Ser Pro Pro Ser His Ser 130 135 140 Pro Phe Gly Leu Ser Ser Leu Ile 145 150 3 861 DNA Homo sapiens 3 gcggaccagg cagggaacaa cctgggagga atcaaatttt attttggaca tgttacttct 60 gaaaggctaa cagacttcca ggcagaaagg tccttgaggg aaacattcta ggggtctctc 120 tgggaggctt agatcaagga gctgagacca aaaggagaat gggagggagg agacgagtac 180 aatagagttg gagccaaggt cctagaggcg gataggtgga ttcctgaggg aggaggaagg 240 ggctgaggtt gctggagcct ggcagcttct tccggagcca ttggcaggac tgatgcaaac 300 agctctgggt gggaagaggg aaccaggata tcctcctgtg tccttccttt tctgcagtca 360 tcctgggtgg ctgccagatg gaattccttg gatatcattg cttggaggtc ccctgcatgc 420 cagaaatggt ttgctggctc catgtggcaa agtcggtcag gattaacgtg ggggggacga 480 gtttccttgg agctgggatc tgtgttaagg agctggggtc cttgtaaagc tggggtctgt 540 gtgccgagag ctctttgaga gtgacttttt tggtctgtgc aagtcatatc acaagtgcca 600 aagtcaagac tggatggagg actgcctggc tgcaaaccaa ttcttcccag gctgtgcttt 660 gcccactggc atggagccca cggtccgggg gtgatggttt ctgttgcatt tcaaagccac 720 tggggcgctg catagatggc tggctgcaga gtctatttga gacgaaggca agaagtcagg 780 gcccatggac tgtagattcc acccttgatc gtctacttct agatgccatc ctctctggga 840 gtcatgagtc tcgatttcct g 861 4 55050 DNA Homo sapiens misc_feature (13351)..(13351) n is a or t or g or c 4 gatcctgcct tttcacacca ccacctggct ctgctgacac atctagtcac agacccctgt 60 gatgctgtta ctcagcaagt ccaaagcttg cccttgtcac ccccttccca cctgcacaga 120 tatgcaaagc agaaaccctc gtgcaggccc gaaagagaaa gcgaaccagt atcgagaacc 180 gagtgagagg caacctggag aatttgttcc tgcagtgccc gaaacccaca ctgcagcaga 240 tcagccacat cgcccagcag cttgggctcg agaaggatgt gagtgccatg tctctctgcg 300 ggctccatct ctttcccctg tcaccacctc gctttcccta gctctggctc ctccaactgc 360 tctagggctg ttggctttgg acagaatgtc caagcagtca ggcctgtctc agctcattct 420 ctaatgtcct cctctaactg ctctagggct gttggctttg gatagaatgt ccaagcagag 480 tcaggcccgt ctctcagctc attgtctaat gtcattctcc tttctgtcat tcactggcag 540 gtggtccgag tgtggttctg taaccggcgc cagaagggca agcgatcaag cagcgactat 600 gcacaacgag aggattttga ggctgctggg tctcctttct cagggggacc agtgtccttt 660 cctctggccc cagggcccca ttttggtacc ccaggctatg ggagccctca cttcactgca 720 ctgtactcct cggtcccttt ccctgagggg gaagcctttc cccctgtctc tgtcaccact 780 ctgggctctc ccatgcattc aaactgaggt gcctgccctt ctaggaatgg gggacagggg 840 gaggggagga gctagggaaa gaaaacctgg agtttgtgcc agggtttttg ggattaagtt 900 cttcattcac taaggaagga attgggaaca caaagggtgg gggcagggga gtttggggca 960 actggttgga gggaaggtga agttcaatga tgctcttgat tttaatccca catcatgtat 1020 cacttttttc ttaaataaag aagcttggga cacagtagat agacacactt atcttggttt 1080 gtccttcagt tactgaggtg gggatgggaa tatccaatgc tcatacccaa gtgaccctga 1140 aactaaggtg ccatttacac tccttaaggt cacacaacat cagagggaga gctgggattg 1200 cagccaagtt tatttgtaca gggccctgtg ataggctagt tcccaaaagc ctgtgatgca 1260 agaacttttg cccatagact cagtcaccat gtagctgtta cctgttcaga gctggctttt 1320 tgctttccca ccctactctg gaattcttaa atggctttat acttagaaat catcttattt 1380 ctgttgaacc tagatcaccc caaccagaaa cttctattaa tactttgtgc tttcttgata 1440 ccagggtcta tttggtttcc acttaaggtt tttgcatact ctgcccataa gtgactcatt 1500 agttactcaa gttttattcc tggctctgcc actagttcat taggggtctt tgccccagag 1560 tcatttcttc catgtaaaaa aacttgggct cattaaatct aggtaggaaa gggcggatgt 1620 ggcaggtttt aatagaacag gtcaagataa ggctttattt ctatagaaat gatgctttga 1680 caatagtttg gcttggtgta aggctcacaa aagaaaatca catgtaccat gtgtgggtta 1740 agcggtttga ttcacactga accaggccag cccagttgcc ctctgctgtg tccacccgtg 1800 gagtggagct gtgtcacagc catcacactg gtaaactgct gtagctggtt taccaggctt 1860 tctcttgccc tgacagtaca ggtgaagcct gtaaataaat cttctgctat ctttgtgaac 1920 ttaaccaaat cccagttacc ttatttaaat ggcaatagat ctgttttccc ttaaactaga 1980 aaccttaatt acctgtattc ctacctccag ctcaacccat atatttgcac ctttccagta 2040 agcaggtttg tatttccatc tctccccttc ccctaagatt ctgaattagt tctccagacc 2100 ttgccaagca cattctccct ggaaagcaag gaatatcaca gacccacaag aagagtaaat 2160 gcccaggagt gaatgaagcg gcttgtcctt gacttggaaa taaaagcaaa gctgtgaaaa 2220 gccaggtcgc tacgattttg ttggcagcag gactagccac agagtaggga agttttgggg 2280 ccaggcccta ggttttccca gaatgccttg ggtgatgcca ccaagaacct taagaactcc 2340 ctcttacatt ttccatcggt tgcaggcatg gcttttgcaa ctgataggtg ctctgactac 2400 agatattctg gtgtccatgg caacatggcc ttatggcttg taatagtggg aacttccagc 2460 tcactggcaa tttcctggag gtggcaatat ctttggcagg aaaccatctc ttctgtcttg 2520 gccaccaggg tctgctccag gccctgaatg agtccacatc aatccatcct tcttccctgt 2580 agtcattccc tcttggggag ggaagggaag gggggtattt atcagaccta tttgctcgct 2640 gctgtgtcta ctcatgggca ggtgtgtccg atggtgcctt ggcggtggac cttaggtctg 2700 aatgccagcc cggcaccctg ggcatcggaa gtcggcctcc ctggcagcct ggatgctgca 2760 gccaacacag gccacatgga accagacgtc acagccatca cactgaaccc aggccactgt 2820 ctcttcctgg ggcaggcagc aacaaggagc tgcacagggc tccccgcccc caactgcagg 2880 gggagccatg ggagcgctca ctgggtactt ccgaggacgg ccacgtcgat ttctgaagga 2940 agtgacacag acacaagggt caccggaaac ctgtaggaga aggcctagac cttgccaccc 3000 tatggctggt catagggtga ggccacagag gcacagtggt ctgagaacta aggtgagcct 3060 ggactctgtc cttaaaaaca catagggtgg gcctaaaaac ctctagtgtg agcctgaaag 3120 ttccccatcc ttaggcacta gaagaatggg gttaaagcca tatgaatgta aaaagcagga 3180 catgctcact ccggttgttt tataagcact gccccacccc acctctcatg ttctcctgca 3240 catatttttc ttttcctatc agagctaacc caggtcttgc cattccatct attccacctc 3300 ccgtgcccat ccccccaggc agggggacct cctcacagcc tagggataca aggaatcgaa 3360 aagcagtaaa acactgagag ggccaagtga ggactccact tacccagtgg gagtcagtgg 3420 ggctttgtct acacggagtt tcttcctggg ctccataagg accggtggcg ggttctcagg 3480 aggctcactc tcatcatcca gttccgccaa cactcggtga acagatttcc tccgattgcg 3540 ttggggtgga gcagaaggcg tggtccccac ttccccaggt ggggcgggaa caggcaggct 3600 ctttggtcca ccccaactag gggagaaggt gaggggctgg ggccggagct tgctgaggct 3660 gcctatggag tttaggatca gtgtggcctt gggggcaggg gagaaggtgc tgagaggccg 3720 ctgtggagcc ccctgggagg gcagcatagg ccggaaacca gccccaaccc gggcttctcc 3780 cctagaccgt gggatggtaa tggcagcaaa gtcctgaggc ttgactcgta cttgttggaa 3840 catgaagtag aactccgagg ggctggttcc tgggggccct tcagggccaa aggtcaggag 3900 gtctccatca ctcaattcca gcctgtgacc tcttgggagt cggacattat tgaccaaagt 3960 acctgttgat ggtggggcac caggcaaata tggaggacaa aaacagaaat gaccagagtc 4020 aggagtggag aaatctgtat gatttctggg gaagaatatg aaactgtagt cagattctcc 4080 ctcacatcct aagcccctca gcaggtgaca ggtaccaggc agtacactcc ttgtcagcta 4140 ggaacaagtc tgtgtttctt gtgctttcct acaaaggatc cctgccctca ctgtcctaag 4200 gtccaatttt ctttctttct tctttttttt tctttctttt tttttttttt gagatggagt 4260 tttgctctat tgcccaagct ggagtgcagt agtgcaatct cagctcactg cactccgcct 4320 cccaggttca agcaattctc ctgcctcagc ctcctgagta gctgggatta caggcatgtg 4380 ccaccacgtc tggctaattt ttttattttt agcagagaca gggttttacc atgttggcca 4440 ggctggtctc atactcctga cctcctgatc tgcccgcctc agcctcccaa agtgctggga 4500 ttacaggcgt gagccaccgt gccccgccct aaggtccagt tttcaaacat gctcactctc 4560 ccccagctcc agctcgttca gatcctagct ctaccactta ccagttatgt gaccctgggc 4620 aagcttttgg tttgtttgtt tgtttgagat ggagtctcgc tctgtcgccc aagctggagt 4680 gcagtggtgc gatctcagct cactgcaacc tccgcctccc gggttcaagt gattctgctg 4740 cttcagcctc ctgagtagct ggggttacag gtgcacgcca ccacacctgg ctaatttttg 4800 tatttttttt tcagcacaga tggggtttca acatgttggt caggatggtc tcgaactcct 4860 gacctcgtga tctgcccatc tcggcctccc aaagtgctgg gattacaggc gtgagccacc 4920 gcgcccaggc tttttttaaa aaattaatta attttttggt agagacaggt tttcaccatg 4980 ttgcccaggc tggtctcgaa ctcctgagct caagtgatcc accctgcctt agcctcccaa 5040 agtgctggat tataggtgta aaccaccatg cccagccctg gacaagttcc ttaacctctt 5100 tgtgcctcat tttcctttac tgtaaatgag gataatatta gtaccaccta cattcaatac 5160 acttaaaata gggcctggct cataacagta agcactagat gaatgttggc tactatagtc 5220 taataaccat aacactccta cagctgactt taccaagagg cagtggcaaa actcaaatgt 5280 gaatctaggc tttctgaggc taactgcctc aagtcacgga gaaagacatg cccagattat 5340 taaaaacatt attcaacata gaactgccct aacccaggaa cagagtgcct cagcaaagct 5400 ggaaactgaa tcaagtctaa acttggcctc tagtggcctc agaacatccc cactgtgata 5460 tctttctgat gatatcccag gtcctgtatg agctaaacgg gaccaaaaag ggtcagtctg 5520 gcagagagct tctgactgtg cacacttctg gtgggaagat gaaggtctgg accacgacta 5580 tccatccaat ctgatgggag tatgggaggc aggcagcaag gtcttgtgga gaccttactc 5640 attccagcct ggcgcttcaa ccacccaggg gcaaagctgc cctgcttaat gctcaccttg 5700 gctgctgtgg tcttccaggc tgaccctcca gtcatcaccc cggggctcgg catgcagttc 5760 ggcgtggatc ccagagatga ggccaggctc ctgctggggc cgcagggcca catcacacag 5820 gtcggccctg tggcccaagc gataggtgca gccagccccg gcgggggggt ggaaggtgta 5880 gagatcaccg cccctgccgc cccctatgcg cagcagttgg aagcagggca gcatgggcgg 5940 tgccccctct gcaccacctc ttccacttca ggcatccatt tcctttccca ctcctgggcc 6000 acgcaccgct ggcttaagtt cgcttcccag tctgatgcaa acttgagttg tgctgtatgc 6060 ggtttcaact ttgagcacac tacctgaata gcccaaattc agagtgcaag cagcctagga 6120 ctcaatgcaa aattggttaa gccgacttgc aaatgaaaaa ttccaggaag gtctccgtgc 6180 aatgcaaact caggaacacg tcctatggca cctcacttga gctggatcct cagatcacaa 6240 aatatttggg gagcctccaa tctgcaaggc aaggatactt taggcagctc ctgtgtgtgt 6300 cggggtggcg gggcgcgggg aaggaggtcc tgttatacac acgtctacgt gcaatacgaa 6360 cacatcacct ctgtgtaatt cggagaagag cagcttcttc ctgtcaaact tgcttatcct 6420 aatagagcac tcctgcttgc ctgcgccccg atcttctggc caatccctca ctcataactg 6480 gttgaacctc cgagggcaat gaatccactg tcatccccca aaagaaagtg caaacgagtc 6540 cttaagtact ccaacaaatt ggacctaaac ccctacttcc cggcttctct aggtgggata 6600 aaaacaggaa ggagctcccc gcgatcatgc aaggcatacc cggccttttc ccactccgcc 6660 ccagccccgt ttcggatcta cccgcgcggc gcagtctccg agtagcctcc tcttcaagca 6720 gtgccagcct gtgcggcgga tcccgggacc ccttgggctc ctcggcagga gccgacgttg 6780 ctgcatctgt ttgacagcca agaagccggg gccaggaggg gcgcggcctc tgcgcgcggg 6840 cagcgcctac ccctccttcg gaattcccgg gcccgaactt cgcgccgagc gagcccgccc 6900 ccgtgccgtt cccgattggc cagctcattt gttctcttcc gcgccattgg acagcgcccc 6960 aagatctccc gccttccact ttttctgatt agttcacaag ttttcccggg ttgccgcagt 7020 gaggaggagc cgcctgggcg cggagatgct ccgacgcatt tagtgggccc aaatttagac 7080 gccactgcgc ctgcgtacac ctttccccct cctcccaaca gagtgaaata ggactttagg 7140 gcgcccgcct tgcgcaggcg cggtcagggg cagaggcggg gccgggagcg aggcgtaggg 7200 ggtgtggcca aagcgcagta gggggctctc gcggttggta agggacgttc gggaagagtc 7260 ggttgggggc gggaaggggc tgagcggggg tcctgggagg gttcaagggg tccaaagggg 7320 aagggggtac aggggccagg tgaagggaca gatgaagaga tcagaacagc cttagagctc 7380 acgggctaac tgggtgaagc ctctggtatc taggggagca gtggtgggag agtagagccg 7440 gctagcgtac agggggcgag ggcaggctat tgagacatgg gggtgagggt caggaccaga 7500 ataattcctt cagaaaaagg tagattaggt tgtaaaaagg gacgggagta gggatcccag 7560 attcgggtaa aaaatagatt aatgagggtg ggagggtggg aggaagatct tcaagtgccc 7620 actatgtgtt aggactcgag aaagaggagg cgggcagcag gcatggaagc taaggggata 7680 gtagagagaa gctgagagaa cagccgcagg aaagatttga ataagatgtc aggacgctgg 7740 cttttctgcc tggggatcta ggcggcctgg attcccaagg tcttggggag ggtctagccc 7800 tgcatgtggc cacattcagc tggggccagg ccttgggcca gcactttaac agggaaggac 7860 ccaagagtca tggcctgctg gtgtctggat gggcttccct caggccttgc tgagccatgg 7920 agagaactct ggagatggcg ctcaagaccc ctgcactgtg tacctccttt ctcacctctg 7980 gccaggagca gcagggacca tagaaactta aggaggaggg taaaggcact gcccagaatc 8040 ctgaaagtat gagtaaagcc tcagagcccc tctcagtctg tctctttcag taaattctct 8100 ccacttgata gttagttgct tgggttccat gaggaaccag gttgagacaa gcaagtgaat 8160 ttggggtatc tcctactttg gggaagtagg aaaagagcta agcatcgggt ggacctggct 8220 ataattcaag gattatgact ttgttattgt aggggctttc tctttggaag gtggcctgaa 8280 aattgaattg gagtgtcttt gtttctcctc ttgtccaggg gaacatagat ggctgaagac 8340 agaatctaga gccttcaaat aatgtggaga tgtttccacc ttcaggtcag tgggaccaga 8400 caaggggagt ggtggttgtc tctgcctggg aaactgacca tctttgtttt tgtaattctt 8460 caggttccac tgggctgatt cccccctccc actttcaagc tcggcccctt tcaactctgc 8520 caagaatggc tcccacctgg ctctcagaca ttcccctggt ccaaccccca ggccatcaag 8580 atgtctcaga gaggcggcta gacacccaga gacctcaagt gaccatgtgg gaacgggatg 8640 tttccagtga caggcaggag ccagggcgga gaggcaggta gggatccatc cacgccgttt 8700 tctcaggctt gcttgctagt gacccttcct cactggaata aactccttac cctatttcgg 8760 ccctagtttc cagaacgtac tcatttttat gtaagcaatt agttcctaca gtgcataaat 8820 aataaagcag tgtgctaagt gctagggatg aggataacaa tagtgaataa gacaaatcct 8880 gacttaaatt tgtaattaat aaggaatcta aaatcctaag aggggagata gattgaccac 8940 cagctgatgg taaagcatgg tatgaggcaa gatgaagaag tgcaaactag aggtagagac 9000 ccagcactgt ggcagtgtgg gttcattctg atggaagaaa gcctttacag aggacatgct 9060 gtgggagctg gccatgctgt cgtattatct gtaaacattc ttattcttgt ttcatgtgtc 9120 ttttcccaga tgtactggga tatcattctg ccattttgct ccagaatagg attacattag 9180 aagaagcagc atgatgaaga aggaatacta gactgggtgt caggagacag agactctatt 9240 tctagtttgg ctaccagcta gaggacattt ggcaagtcac ttaatttccc tagcctacag 9300 atcgctgcta ctaaagaaag aaagaaagag cgagccagat atggcggctc acacctataa 9360 tcctagcact ttgggagctg aggcaggagg atcacttgag cccaggagtt catgaccagc 9420 gtgggcaaca aagcgagact ccgtctccac agaaaataat tagctggctg tggtggcatg 9480 catttgtagt cctagctact caggaggcta aggtggtagg accccttgag ctcagaagtt 9540 gaagactgca gtgacctatg atccagccac tgcattgagg cctgggtgac acagtgatac 9600 cctgtctcta aaaacgacaa caacaacaat ctcttatagt cctgggtctc agagagctgc 9660 ctcaggagcc atgttccaag ctggattaaa cttcacgtga cattggtaga cgatttctct 9720 aaaggctggc actgtgttat ttatgtactg ttctctcaga ctcctcataa agtataccta 9780 acactcaata aatgcctttt tttttttttt tttttttgag atggagtctc actctgtcgc 9840 ccaggctgga gtgcactggg gagatcttgg ctcactgcaa gctccgcctc ccaggttgac 9900 gccattctcc tgcctcagcc tcccaagcag ctgggactac aggcacctgc caccacgccc 9960 agctagtttt ttatattttt agtagagatg gggtttcacc gtgttaacca ggatgatctc 10020 gatctcctga cctcgtgatc cgcccgcctc ggcctcccaa agtgctggga ttacaggcat 10080 gagccaccgc gcccagccaa taaatgcctt ttaactagca cctggcctca ccatattgat 10140 actggaagct tacgacctct ctatgcccat tcctccctca aacttcttgc ccttaaatta 10200 gaattgagaa gtccctgtgt gttcttccaa cctttccact taaaatgtgt ggcctaaaaa 10260 taaaaaaata aataagacaa aaaaaccacc aaaaaacaaa aagaatgtgt ggcctaatga 10320 tatacacatt tgatgttgaa tcctcctgta tgtagctctc tctagggaga tcatgttcca 10380 cgatctcagc cagactttag gtcctgtgag tccaggcact ggactcaaca tgctcaatag 10440 ggctttgatg aatgatgatg atgtcaatgc agacatcccc ataccccagc ttcagcaccc 10500 ccttcacctc cccacacgga agcagagggg tcctcttttc cttctcctgg ctatgtttat 10560 gccctcaact atccttccag cactggagac aagtctcacc tgcactaacc tgtctttgaa 10620 ggtcctgggg gctggagggg tcacaggccc tgagccagca ggctgaggtg atcgttcggc 10680 agctgcaaga gctgcggcgg ctggaggagg aggtccggct cctgcgggag acctcgctgc 10740 agcagaagat gaggctagag gcccaggcca tggagctaga ggctctggca cgggcggaga 10800 aggccggccg agctgaggct gagggcctgc gtgctgcttt ggctggggct gaggttgtcc 10860 ggaagaactt ggaagagggg aggcagcggg agctggaaga ggttcagagg ctgcaccaag 10920 agcaggtgaa tgcaggggta gaaaggattc aaattcataa cggagagctg ggcagtagct 10980 tccaagcaaa gaacaggtat tgcagaaaag accctccatg agtagtgagt agtagagtga 11040 tgagaccttt ggtgaaaata aacacacatg ggctagaaag atggagaatt tgggtacttt 11100 tatcttaatt caggttgcac ttttccccca agactcaggt ggcctcccac cacttgaagc 11160 cctgcttccc tttctagtag gaaatagttt gcctccctac tttactccag gtaccaattc 11220 atcaggggta ttgtggaggg cagtgaggaa ggtgggtata agggggctat ggtggactgg 11280 gagagagaga ttattcagtc ctcaaactca gtacttactc tgccatctct aagactcagt 11340 gaccaaatta gcctgagtcc tgccttcgtg gatggaactg acagcctcgt gggggatata 11400 cacatccaca tttaatttaa tcataattaa ttgcaatcag gaggaatgcc ttgaaggaga 11460 aggactagaa agcactaact tagtctgggg ttcagggaag gccactgtaa ccagttgaca 11520 tgtcaaggaa agaaaccata gccctgggcg cagtggctca cgcctgtaat cccagcattt 11580 tgggaggccg aagcaggcag atcacaaggt caggagttca agaccagcct ggccaacatg 11640 gcgaaaccct gtctctacta aaaatacaaa aattcaccag gcatggtggt gcgcacctgt 11700 aatcccagct acttgggagg ccgaggcagg agaatcactt gaacctgggc agtggaggtt 11760 gcagtgagcc gagactgtgc cattgcactc cagcctgggc gacagagcaa gactctgtct 11820 caaaaaataa aaaaggaact gggagaaaac aagggagaat tcctttataa ccttgtagtg 11880 ggcaaggcct ttctacctgt gagtcaaaat ccaaaatcta gaagccataa aggaaaaaat 11940 tgatccattg actttataac atgaacatta ggaatagcca aaaagaaaaa aaaaagctat 12000 atttatagct cagatcacaa gaaaagggta atatccctaa tataaaatgt gtgcctagaa 12060 attggtaagg gaaagaccag caatccaatc agaaaatgga caaaggagat ttatgaaaga 12120 aacttagaaa caagaagcta ggccaggcac agtggctcat gcctgtaatc ccagcacttt 12180 gggaggccga ggtgggcgga tcacttgagg ccaggagtta aagaccaacc tggccaacat 12240 gaagaactct acaaaaaaat acaaaaatta gccgggtata gtcgtgggcg cctgtaatcc 12300 cccagcttct tgggaggctg aggcaggaga attgcttgaa cctgggagac agaggttaca 12360 gtgagctgac atcacactcc agcctgggca gcagagcgag actaaaaaaa caacaagcta 12420 ccgtttgtgc tgaataggag ttggccagtg aagaggcgtg tgaagtccag tggtagctgg 12480 aagacacttg gtgggacaac aggtgaaggc ggggacagga ggccagaagg ctggggcaca 12540 gagatgaggg gcactgagtg tgctgcagag cccaggaccc agggcacaag gctttggcca 12600 cttcagaact tgctactttc ccataagagc aatgagcagg ctgggcacag tggctcatac 12660 ctgtaatcct agcactttgg gaggccaagg tggaaggatc atttgagccc aggagtttga 12720 gaccagcctg ggcaacaaag cgagaccccc atctctattt tatggaagaa attagggctg 12780 ggcatggttg ctcacatgtg taatcctagc actttgggaa gctgaggcgg gtggatcact 12840 tgaggtcagg agttcgagac cagcctggcc aacatggtaa aacctcatct ctactaaaaa 12900 tacaaaaatt agctgggcgt ggtggctcat gcctgtaatc tcagctactc gggagggtga 12960 ggcaggagaa tcgcttgaac ctgggaggca gcgtttgcag tgagctgaga tcgtgccatt 13020 gcactccagt ctaggcaaca aagtgaaact ccatctccaa aaaaacaaac aaaaaaaaat 13080 tgttttttca agtaataagc aaccgttgaa aggttgtttt tttttttttt tagatggagt 13140 ctcgctctgt cgcccaggct ggagtgcagt ggcgcgatct tggctcactg caagctccgc 13200 ctcccgggtt catgtcattc ttccgagtag ctgggactac aggcgcccgc caccacgccc 13260 agctaatttt ttgtatttta atagagacgg ggtttcaccg tgttagccag gatgatctcg 13320 atctcctgac cttgtgatcc acccacctcg ngcctcccaa agtgctggaa ttacaggaat 13380 gagccactgc gcccggcctg ntgttgaaag gttttaagca gggaaataac atgattagat 13440 ttgtatttta tgtctaaaaa attttgtcat ttatgtcccc caaattaatt ttattgttgt 13500 atggagacag ggctagagga ggcagaccag gaagcagggt gggcactttg ccctcctttc 13560 cagtccatcc catgactctt ggtggctctg acacccctgc aaccctttga ggtgccatga 13620 gcaaaagaca caaaattcct cctttcctgg agctttcctt ccagtgtggt ccgacagata 13680 gtaacacata cacataagca agatatggtc agtgctaagt gctcaggagg acgtgaacag 13740 ctgatggggc agagtagggt ggggagggac ggtattagag ggcccagtga agccaccctg 13800 aggaggggct attgcctggg gtctgtggag caaggagggg ccgctgtctg gttctcagca 13860 gactccccgt ggccggagcg gggagcagtg ggagagcctc cagggtgagc tcaggaggta 13920 ggcagaggcc gggtcccctg gcctgcaggt gtggagagac acctgggttt tgttgtgaat 13980 gctgtgagaa gccactgagg gtttgtaaag actagttagg agatggtcgc tgttgcccag 14040 gcaaaagatg agggttggtg gcagtggaga cggagacaga gaggtgaaga tatgttttgg 14100 gggagatcgg acaagaactc ctgatgggtt gtggggcagc tgcggagagt gagttgccag 14160 ctctccattt gctgtgcaca gttggctgat tggttgggtc attctctaag gtcacagaaa 14220 gtgggagtga agggaacaag gaaggcctcc gtgtggggtc gagcctctgc tgagccccct 14280 cttctttccg cagctgtcct ctttgacaca ggctcacgag gaggctcttt ccagtttgac 14340 cagcaaggct gagggcttgg agaagtctct gagtagtctg gaaaccagaa gagcagggga 14400 agccaaggag ctggccgagg ctcagaggga ggccgagctg cttcggaagc agctgaggta 14460 ggtgggcgga cgccgacggg agcccagcaa ttagtgatgt ggtggatctg cagggcgccc 14520 cactgatggc tgtcccattc ccaccccaac cctagcaaga cccaggaaga cttggaggct 14580 caggtgaccc tggttgagaa tctaagaaaa tatgttgggg aacaagtccc ttctgaggtc 14640 cacagccaga catgggaact ggagcgacag aagcttctgg aaaccatgca ggtgagggtg 14700 caggaatgta tctgtgtgca gacttaggga tcaggttggg aggcaagcgt ggcccttgga 14760 ggagcgtgta gagcacagcc tccgggagag aaggtggtac ctaaggcggc atggaggccc 14820 tacagagggg ctgctttcct ctgcccgcag cacttgcagg aggaccggga cagcctgcat 14880 gccaccgcgg agctgctgca ggtgcgggtg cagagcctca cacacatcct cgccctgcag 14940 gaggaggagc tgaccaggaa ggtacagccc aacccccaga cccctcaccc tcagccgcat 15000 cctgcatcta ctgtcccctg cctccctccc tgtgggcagg aggggtcaat gtgccccaga 15060 acctgcttag atctccttcc tgtgaactcc tcttgctgta gctcatgttg cccaggcagg 15120 acagaggaga aacaaagatg ccacctcctt cctctcctcc cccaggagcc cacgcttttc 15180 tcccactcct tctccctcag gttcaacctt cagattccct ggagcctgag tttaccagga 15240 agtgccagtc cctgctgaac cgctggcggg agaaggtgtt tgccctcatg gtgcagctaa 15300 aggcccagga gctggaacac agtgactctg ttaagcagct gaagggacag gtcactgcac 15360 tctcttttct cccggtattc cctcccagca ccttgctcct tccatgaagg tggcatccat 15420 tcaaccagtg tttattgagt ggttgccaca tgctgggcac acagccctga acaaaactaa 15480 aatgtggagc ttgcattcta gaacagagac acagaacacg caagtaaaca gataatgttg 15540 ggtaattata tgtgcgatag aaagattgaa gccgggtgca gtggctcaca cctataatgc 15600 gatcactttg gtctcgaact cctgacctca ggtgattcac ctgcctcagc ctcccaaagt 15660 gatgggatta caggtgtgag ccaccgtgcc cagtcaagta atgccaacag tttgggagac 15720 cgaggcaggt ggatcactgg aggtcaggag ttcgagacca gcctgggcaa catgtgaaat 15780 cccgtctcta ctaaaaatac aaaaaattag ccgggcatag tggctcattc ctgtagtccc 15840 agctactctg gaggatgagg tgggaggatc acctgaggct gggaggtcga ggcgaggcca 15900 cagtgaactg tgatcccatc actgcactct agcctgggtg acaaagcgag atcttttctc 15960 aaaaaaaaag aaagtagtaa gaaaaattca aaagataatg tgacagagag actgtggggt 16020 gagtcagcct caggtaggat gctcagagac agcctctctg aggaggtgac agcatctgag 16080 gagagtggca tggtcagttg gtgggtcttg tggggtgggt caagggctat tcccatcttc 16140 gagtgggcac atggaatgtg gaacatggaa cactgggctc agattccatc ctcagaacct 16200 aagcttctgt ctccctgcgt ggcattcatt ctttttcttt tcttttcttt tttttttttt 16260 ttgagaagga gtcttgttct tgtcacccag gctggagtgc agtggcctga tctcagctca 16320 ctgcaacctc cgcctcccag gttcaagtga ttctcctgcc tcagcctccc gagtagctgg 16380 gattacaggc acatgccatc acgctcagct aatttttgta tttttagtag agacaggctt 16440 tcaccatgtt ggccaggctg atcttgaacc cctgacctca agtgatccat ctgcctcggc 16500 ctcccaaagt gctgggatta caggtgtgag ccaccgtgct gcgacccacc cccgtcgccc 16560 gccctccctc cccccagccc ctgcatggca ttcttacaga gatctctgca cctgccactt 16620 tgcttccagt gcccccctca tcttttagct ctagagggcc ctgcccagct ctctctcctc 16680 ccccaggtgg cctcactcca ggaaaaagtg acatcccaga gccaggagca ggccatcctg 16740 cagcgatccc tgcaggacaa agccgcagag gtggaggtgg agcgtatggg tgccaaggtt 16800 ggtgtcagcc tactagagac tcggggaggg caagggagcc cctgttccgg ggctgcagcc 16860 aggacttagg gagggaccct gtcctttgct gcatcctccc cagggcctgc agttggagct 16920 gagccgtgct caggaggcca ggcgttggtg gcagcagcag acagcctcag ccgaggagca 16980 gttgaggctt gtggtcaatg ctgtcagcag gtatcaggga tggaggggtg ggtggagtag 17040 tgtttctgct acctcaggtt cctgggcacc ttgttgctga ggatcctcag gcaagagggg 17100 ctggaaagtg gccactggag gctacagggc tgggcagatt tagctctatc aatgttcctg 17160 tgttcgtttc ttttcctggg gaagcccctt ctgcattcat acctgattgc ttgttatgaa 17220 tttcccgttg catgtttggc tggaggtgag gccttgcttc ctcctgcagt tcagtctagt 17280 aatggcttga gctaaataga gcacccggga ggatcttcac ttgcagtatt gttcaaggat 17340 ggagagtgta gacacttcat cttccttttt ttttctaaaa ttttacgggc aatccgtttc 17400 actggagaaa aatttagtct atttatttat ttattttgag acaaagtctc gctctgtcac 17460 ccaggctgga gtgcaatggc rcaatcttgg cycactgcaa cctcacctcc ctggttcaag 17520 tgattctcct gcctcagcct cccgagtagc tggattacag gcatcctcca ccagtgtcct 17580 ccactacgcc cggctaattt ttgcattttt agtagggacg gggtttcacc atgttggcca 17640 ggctggtctt gaactcctga cctcaggtga tccacccacc tcagcctccc aaagtgctgg 17700 aattataggt gtgagccact gcacctggcc tagtctattt atttaaagct gtatacttac 17760 ttgcttatta tatacttaac ttgcttacta ttccatctaa aatgtaagcc agttagtttc 17820 cttctaaatc aattgccagc ctttgtctct cctaccaact tcctagttgt ttcattacct 17880 acaattgttg tatgaccttc agaaaaacct ctaagaaaac agcaaagctt ctttgtgctg 17940 gtgatgactt cccctcagcc ttagacactg aggtacccaa ggcaggtagt tctttttttt 18000 tttttttgag acagagtctc gcactgtcac ccaggctgga gtgcaatggc acgatctcag 18060 ctcactgcaa cctctgcctc ccgggttcac acgattttcc tgcctcagcc tcctgagtag 18120 ctgggattac aggtgcacac caccacaccc ggctactttt ttgtattttt agtagagaca 18180 gggtttcact gtgttggcca ggctggtctc aaacttctga cctcgtgatc cgcccgcctc 18240 ggcctcccaa agtgctggga ttacaggctt gagccaccgt gcccggccgg caggtagttc 18300 ttagcacagt ctctggcttg taaatgttga ttgttatcgt gaggctcttc ttgatgggtt 18360 aatttagata aagataattt ttggtttagc gaaattaaga tgcaggatga gtccttgccc 18420 accacttcct tctcttgggt ttgtacctta gggactaatt tagctttaaa aattatgaaa 18480 aatttcaaac ttgcacggaa ttaaactagt tataatgacc taccacccag gttcaatcca 18540 tcgcgggtgc cccctacctc caggagaaca gaaagatgca ctgtggacag ggtttgactt 18600 tggtaccagg tgatcacaga aatggtctgt gtaatgatgc atttgccgaa agtcctccag 18660 gcttaaagca gtccaaactc tgattgttgc tgggtttatt agattgtctc taggtaattg 18720 gagactttaa taagagctgt ggtaggtgtt ggaagcatct actggaacaa tttccagatc 18780 aaagtgaact ttgctgtgct gctggggatg cagctgcaag ccttcatcat catgtgtttt 18840 tctgtgggtg cagacctgga ctctcctgag gaaaccccag cccggcccca gacactcctt 18900 ggcctcctcc tggggcctgt ttaagctgct cagttttcat gagccaggtt ggtcctactt 18960 ctggcacagc cagctggtaa agcatgtgga cctgcccgtc actggtgcta gatcgacact 19020 cctgggcttg gagaggataa ctttgttttc ttttgttttt ttgagatgga gtctcgctct 19080 gtcacccagt cttgagggca ggggtgcgat cttggctcac tggaacctcc acctcctagg 19140 ttcaagtgat tctcgtgcct cagcctctgg agtagctggg attacaggca tgagccacca 19200 tgcccggcta atttttgtgt ttttaagtag agagagtttc accrtgttgg ccaggctagt 19260 ctccaattcc tgacttcagg tgatccgccc gcctcggcct cccaaagtgc tgggattaca 19320 ggcaggagcc actgcccctg accagagagg ataactttac tctttgatac acgatagtga 19380 gcaaaacaca gttgtgagaa ataagcttaa caggttgctt aaaaagatag tcatttaatg 19440 cattcttggg gcaagggtcc tttagataat tgacggaagc tgtgcgttct gtacttgtat 19500 aatgggacag gattagaggg agttgtctat acaaggcaca gcaagtcctt tgggaatgag 19560 gggaggcatg gaggatcagt gacttgtgcc ctctccagct ctcagatctg gctcgagacc 19620 accatggcta aggtggaagg ggctgccgcc cagcttccca gcctcaacaa ccgactcagc 19680 tatgctgtcc gcaaggtcca caccattcgg ggtgcgtagg acaactgcga gccacgtcct 19740 gcccccaccc caccagctcg gactttcttc ttcctgaccc agctctctct gatcccacat 19800 ccattcacct tcctcctttc accagtcctt gcatctcttt ttcccttact ccctgtcccc 19860 actttctccc atgcaaactt catctctttt tctccctgct ttttccctcc caggcctgat 19920 tgctcgaaag cttgcccttg ctcagctgcg ccaggagagg tgaagtttgg gcactttgag 19980 gtggatgggg ctttagggca ttggctgctg ggacccccaa aaccatgagg actgaggtgg 20040 gatgggggct ttgggatcag gcagctgggt gatttctcct gactctttct cttccccgtc 20100 tcagctgtcc cctaccacca ccggtcacag atgtgagcct tgagttgcag cagctgcggg 20160 aagaacggaa ccgcctggat gcagaactgc agctgagtgc ccgcctcatc cagcaggagg 20220 tgggccgggc tcgggagcaa ggtacacctg gttgccagag ggtggagagg atgaggaaaa 20280 acccagtgtc tagggtgctg ggagaggcct gacccagcac cccctccttt taggggaggc 20340 agagcggcag cagctgagca aggtggccca gcagctggag caggagctgc agcagaccca 20400 ggagtccctg gctagcttgg ggctgcagct ggaggtagca cgccagtgcc agcaggagag 20460 cacagaggag gctgccagtc tgcggcagga gctgacccag cagcaggaac tctacgggca 20520 aggtgtcgag agggaaatgg gtgcttccct tggagggtgg ggtgggaact gcgaatcaaa 20580 ggtcctgctg atatgccccg tctgcacttt caccccagcc ctgcaagaaa aggtggctga 20640 agtggaaact cggctgcggg agcaactctc agacacagag aggaggctga acgaggctcg 20700 gagggagcat gccaaggccg gtgagccttg ccagggtgga tagggccttc caggaagaag 20760 gaagtgttaa gacataaggt tattattttc ccctcaaagt gtgttcaaag cttcattaca 20820 ggaagtaatg aaggtatcca ggagtagcac agatgaatta tcacatcgtg aacacaccca 20880 tgtagccagc accagattaa gaaacagcat atggccggtc gcggtggctt atgcctgtaa 20940 tccaagcact ttgggaggcc gaggtgggtg tatcacctga ggtcaggagt ttgaggccag 21000 cctggacaac atggcgaaac cctgtctcga ctaaaaatac aaaaattagc taggcctggt 21060 ggtgggcacc tgtacmccaa gcttacttgt gaggctgatg tgggaagayt acatgaaccc 21120 gggaggtcga ggctgcagtg agccaagatt gtgccactgc actcaagcct gggtgataga 21180 gaaagaccat gtgtcaaaaa aagaattgtg taatgaatgt atcttctcta actaaatata 21240 gcagttaaca tttgccacat ttggtctctt atctatatac acacatattt gtacatcttt 21300 tgaatcactt taagttgtaa tcatttaatg ttttgttgtt gttgttgttt gagacagagt 21360 cttcctctgt caccagctgg agtgcagtgg catgatcttg gctcactgtg acctctgcct 21420 cccgggttca agccattctc ctggctcagc ctcccaagta gctgggatta caggcgccca 21480 ccaccatgcc cagctagttt ttgtatttkc agtagagacg ggattacacc atgttggcca 21540 ggatggtctc gagctcctga cctcgtgatc cgcccgcytt ggcatcccaa agtgctggga 21600 ttataggcgt gagccaccac gcctaagtaa gttgtaaaca taagttgttc agccgcatct 21660 cccaaagcca gtaaattctc ctatatagct gcaatcatca cactttaaga cagtgaacac 21720 taattgcaca aaatctaacc cagttcatgt tcagatttcc cctgaggaac tccaggatgg 21780 ttcagggatg aggaagatac ttaggttcag attcccaggc tcctagagca tcagccaacc 21840 cctccaactg tacagaagag acagatccac agagcagaac agcctcccca agccacagag 21900 ttggtgaccc agcgtttgtt cctgtcttca tggtgcctgg ctgcctctgg cctgactcac 21960 acctgcctcc tctgtgcctt ggcctctctg tagtggtctc cttgcgccag attcagcgca 22020 gagccgccca ggaaaaggag cggagccagg aactcaggcg tctgcaggag gaggcccgga 22080 aggaggaggg gcagcgactg gcccggcgct tgcaggagct agagagggat aagaacctca 22140 tgctggtagg agacaggagg gcagacaggc agacactagg gcccatcctg ggctggttcc 22200 tgggctagag gtgtggaaag aggatggtga gggaggctct atccgggcta ggtttaaccc 22260 tctccttccc aggccacctt gcagcaggaa ggtctcctct cccgttacaa gcagcagcga 22320 ctgttgacag ttcttccttc cctactggat aagaagaaat ctgtggtgtc cagccccagg 22380 cctccagagt gttcggcatc tgcacctgta gcagcagcag tgcccaccag ggagtccata 22440 aaaggtcttg ggccaagcac aaagggacaa gggacaaatg cgcgcacttc aggaatctcc 22500 tcttcagact ctcgcatgat gagtgttgtt ctctgcggtc cttcgaggcc cttagcctct 22560 tttagcgatg cccagcttgg accaaagagc ctcctctctc ccattcctca tttcctgtgc 22620 cagccctgtt tcctctgtaa ccacgagcac cttcccttgt ctggtgctca tctgctgtct 22680 tccttcccag ggtccctctc tgtcctgctc gatgacctgc aggacctgag tgaagccatt 22740 tccaaagagg aagctgtttg tcaaggagac aaccttgaca gatgctccag ctgcaatccc 22800 cagatgagca gctaagcagc tgacagttgg agggaaagcc agcctggggg ctgggaggat 22860 cctggagaag tgggtgggga cagaccagcc cttccccatc ctggggttgc cctgggggat 22920 accagctgag tctgaattct gctctaaata aagacgacta cagaaggagc cattgtttgg 22980 aaatgttatt tctgggaatc tgtgggaatt ctccctgcag ccatctctct tggagattgt 23040 gagaagtggt cccacccata gctctctcta tgggtggcac cttttcaaat cctttttcta 23100 ggcagctttg gaaatgtttg ggaagctcct gcttttcttt attctcaact tttgttttaa 23160 aactcttttc ttctggaaga tgataaatgc taggagtggg gaagatgaaa cattcactcg 23220 ttcaacagga atgtcaatgc caggcactgt attagcctgg gggccacttg gaccctgaga 23280 gcctacatcc tagtgggaga gmgagcgagc agatgggacc taaacagatg aacgagatcg 23340 tttcaaataa gtgcttgaag agaaagtaca gtacgatgat atgatagaaa ctgactctgg 23400 ggagaggcct ctgagcaggg gaccttgtgc agaggcctgc ctgaagtggg ctcctgtgga 23460 ggccaggagg tctctgtaga gaatgctcaa ggatgctgct gcgaggccga gatgtggcca 23520 agtttggatt ttactgtaag tcaatagaaa gccattggag tacttcttta aaaacgggga 23580 agaaagccag gtgtggtggc tcacgcctgt aatcctgaca ctttgggagg ctgaggcacg 23640 cggatcacct gagctcagga gtttgagacc agcctgggca acatggtgaa acttcgtctc 23700 tactaaaata caaaaaatta gccaggtgtg gtggtgtgag cctgtagtcc cagctactca 23760 ggaggctgac acaggagaat tgcttggaca cgggaggtgg aggttgcagt gagccgtgat 23820 catgccactg cactccagcc tgggtgacag agcgagactc agcctcaaaa aaagaagaag 23880 aattaaaaaa aaaaaaaaaa aacagggaag ggggcccggt gttgtggctc ccgcctgtaa 23940 tcccagcact tttttttttt tgagacagtt ttgctcttgt tgccaggctg gagtgcagtg 24000 gcacaatctc agctcactgc aacctccacc tcccaggttc aagcaattct cctgcctcag 24060 cctccggagt agctgaatcc cagcactttt ttgggaggcc gaggtgagcg gatcgcttga 24120 gcccaggagt ttgagaccag ccaacatagg gagactccat ctatataaaa aataaaaata 24180 aaaaacattg gagggcatgg tggcgtgcac ccatggtcct agttactcca gaggctgagt 24240 tgggaggatt gtttgagccc aagaggttga ggctgcaatg agctgtgatc gcaccactgc 24300 accccagcct aagagagcga gaacctgtcc tcactcccca aacaaaggga agacgcaaca 24360 tgatttggtg tacatttttt ttcctgaggg acttactgga tggtcccttc cagagtgagg 24420 tacacatatc cacgcactgt ggtcagcgat tgctcccggc acccagcgca gcagatgggt 24480 gggtctaacc aggtcacttc cccaggaggg cataattgag cagtttccgc atcaggtcca 24540 cgtgggccag cagcatgcgg caggcagcgg ctgtggygct tcaggcccag ggcgtcccca 24600 tcagcgtact tggactgcag cagccccagg taggcctccc acgtgttgcc cacagtcttg 24660 ctgcaggtga agcagcgcac acaactcctg agctcaagtg gtcctcctgt gtcagcttcc 24720 caaagtgctg gcatgagcca ctacgcctgg cctgatatat atatatatgt tttttttttt 24780 tttgagacgg aatttcactc gttgcccagg ctggagtgca atggtgtgat ctcagctcac 24840 cacaacctct gcctcctggg ttcaagcgat tctcctgcct cagcctcttg agtagctggg 24900 attaccaaga ggcatgtgcc accatgtccg gctaatattt ttgtattttt agtagagacg 24960 gggtttctcc atgttggtca ggctggtctt gaactcccga cctcaggtga tctgcccgct 25020 tcggcctccc aaagtgctgg gattacaggc gcgagccacc acaccaggcc cgcgtgatgt 25080 atattttaag acctcttttg ctggtggagg acaggctttg tgtgaggggg agggataaac 25140 agtgggagca agggggccaa ttagaagggt gttggggagg ctcaggggag atggtggctc 25200 aggatgatgg gctgggtttg gacagggtgt ggaggggctt gcaggtggat ggtggaggag 25260 tgtaacgaag gtttctgcgt gagccctgga gggaacagat gagatcacgc cattgcataa 25320 taaggtgttc cttactgtgg ggtagcggac caggcaggga acaacctggg aggaatcaaa 25380 ttttattttg gacatgttac ttctgaaagg ctaacagact tccaggcaga aaggtccttg 25440 agggaaacat tctaggggtc tctctgggag gcttagatca aggagctgag accaaaagga 25500 gaatgggagg gaggagacga gtacaataga gttggagcca aggtcctaga ggcggatagg 25560 tggattcctg agggaggagg aaggggctga ggttgctgga gcctggcagc ttcttccgga 25620 gccattggca ggactgatgc aaacagctct gggtgggaag agggaaccag gatatcctcc 25680 tgtgtccttc cttttctgca gtcatcctgg gtggctgcca gatggaattc cttggatatc 25740 attgcttgga ggtcccctgc atgcctgaag aaggacatgg tggagagcag gatgcctgga 25800 tcccatgggg gaagggaagt gcccaggaaa gcacgaagcc ccagggggag ctttcagtgc 25860 ggggrtgagt ggggaggctg gggtagtagc tgacactgtc ccagctgcat cccaggtttg 25920 aaaggcacct cctcccccag cgcaggcatc ctgcctccca accctgtaat tacggtgctt 25980 cccaacgccc atcgygtggt ttgctcccat tctttggctt ccaatagttg caagggatga 26040 aggtggacat ctctgtgatt acggagatgc caagtgggta ttgactgctc cagggtgtgg 26100 atggagggtg tgaaaaccag ggtggggtga cgcaggctct gggtcatgat agggagagca 26160 ggcagctggg tcctgggctg gaggactaaa ataagggacg ccaccttcag gggtgacaca 26220 tcagcccagg ccttcccaac gggtttgacc agttctgttc tgatggtatt cctgtgccac 26280 tgggctggyc cctcctccac tcctccccta taaagcctct tggggttccc aggcacccag 26340 actcagccca ccccagcttt gggggccagt acatagccat gatcctcaac tggaagctcc 26400 tggggatcct ggtcctttgc ctgcacacca gaggtgaggt gggaacagag gcagggactg 26460 cagtttgggg tgatgaggga tactcaagat ggcggaggtg aactggacgc atggggttgg 26520 ggacaggaat tcaggggayg cagaaggtgc atctggctca ccagaaatgg ctttctggac 26580 acattgggtg ggggacatgg tgcagaaggt gcatttggct ctcaccagaa atggtttgct 26640 ggctccatgt ggcaaagtcg gtcaggatta acgtgggggg ggacgagttt ccttggagct 26700 gggatctgtg ttaaggagct ggggtccttg taaagctggg gtctgtgtgc ctgggggcca 26760 aggtgtaacc caccttgggt tgcaggttgg cctgaggaca aagctagtgg ggtaccccaa 26820 ccaggggtgg atggagctta tttggagaag tctggtcagt ttaaagtggg tcaagtgaac 26880 ggttcagatc catcgggggt aggggttcat gacattttac catcagttaa gtatttacaa 26940 acctaccgag agctctttga gagtgacttt tttggtctgt ttgtgggtca gttcaggctg 27000 cgtccrtcca gacaggctcc tcctcctggg gctggggctg ggtggggctg gggagagaag 27060 ccctcaccac ctcttacctt tctccttcct cctttacagg catctcaggc agcgagggcc 27120 acccctctca cccacccgca gaggaccgag aggaggcagg ctccccaaca ttgcctcagg 27180 gccccccagt ccccggtgac ccttggccag gggcaccccc tctctttgaa gatcctccgc 27240 ctacccgccc cagtcgtccc tggagagacc tgcctgaaac tggagtctgg ccccctgaac 27300 cgcctagaac ggatcctcct caacctcccc ggcctgacga cccttggccg gcaggacccc 27360 agcccccaga aaacccctgg cctcctgccc ctgaggtgga caaccgacct caggaggagc 27420 cagacctaga cccaccccgg gaagagtaca gataatggag tcccctcagc cgttctgttc 27480 ccaggcatct ccaggcaccc acgccctctc caccctctga ttccccgtga attcttccca 27540 atttagccta tctccttaaa cctcttcctc attccctcgg ttttattctg aacccgtaag 27600 gtggtgttct caatatttcc tgtcccctcc tgagatccat acttagtcct cacatcgccc 27660 gttttttcct ctgacagcct aagcctactc tcctacctcg cctccaggcc tcggccccac 27720 ctacctccca cccggtcttc ctgcccgcgc gatcgctggg gcagggctay ggtactgtgt 27780 tcccttctgc cacctggtgg ccggcggcag gaactatcag tagacagctg ctgcttccat 27840 gaaacggaaa aataaaaatc atgttttctt aaytctgaat ctaggctgct gctttaacta 27900 acacttaggg tctttttcat ttatttttat ttatttgttt ttttcttttt ttgagacgaa 27960 gtctcgctct gtcgcccagg ctggagtaca gtggcacgat ctcggctcac tgcaagctcc 28020 gcctcccggg ttcacgctat tctgcctcag cctcccgagt agctgggact acaggcgccc 28080 gccaccacgc caggctaatt ttttgtattt tttagtagag gcggggtttc accgtgttag 28140 ccagagtggt ctcgatctcc tgacctcgtg atctgcccgc ctgggcctcc caaagtgctg 28200 ggattacacg cgtgagccac agcgcccggc ttctttcttc tttttctttc tttttttttt 28260 agatggagtc tcactctatg cccaggctgg agtgcaatgg cacgatctcg gctcactgca 28320 acctccggct cccgggttca agccattctc ctgcctcagc cttctgagta gctgggatta 28380 caggtgcgca ccaccatgcc cggctaattt ttgtatttta gtagagatgg ggtttcacca 28440 tgttggccag gctggtctcg aactcctgac atcgtgatct ccccgcctcg gcctcccaaa 28500 gggctgggat tacaggcgtg agccaccgtg cccggccaac acttatgttt ttgactatta 28560 ggatgccctc ttcacagtcc taaacttacg gagacctgga agtaacttga gttcctatct 28620 tgcccatgtc cagcatgtaa ggctctgggg cttagcagga ggagggttgg aaatgtcact 28680 atgcaagtca caataacatt caggcccaca tttctccctt tctgagaaca ctatattaaa 28740 gaatgggaag gcaagtttca tctctgttta atggcctatg gcttggatac ccctagtggt 28800 atatgcaaac cttcccaggg gtgtgtcggc aggaccagtt ttaagggaat cagtttccag 28860 attaatatgt gccccccgct agaatgaatc tcctgcttgt cctgggcctg accagagtgc 28920 ccttcccaga gccgccaaag gtcaatagga aacaaatcaa cctttcccat ctcattaaga 28980 gattcatttt ctttcttttc tttttttttt ttttttgaga cgtattatct ctctgtcgcc 29040 caggctggag tgcagtggca cgacagatat cagctcactg caagcttcgc ctcctggatt 29100 caagtgattc tcctgcctca acctcccgag tagctgggat tacaggtgtg tgccaccaca 29160 cccagataag ttttctattt ttagtagaga tgggattttg ccatgttggc caggctggtc 29220 tcgaattcct gatctcatgg gatctgcttg ccttggcctc ccaaagtgct aggattacag 29280 gtgtaaacca ccacgcctgg ccaagagatg cattttcaat aagttacttt tcatgtcttt 29340 ttgtgtgttt gtttgagaca gggtctccat ctgtcatcca ggctggagtg cagtggcacg 29400 atcatggctc gtatagcttc aacctcctgg gctcaagcaa tcctcctatc tcagcctctg 29460 gcgtagctga gactacaggt gcaccacccc tgactaattt tttgtatttg tttagtttag 29520 ttttgtttcg tttttagaga tggggtttta ccgtgttgcc cgggcttgtc tcaaactcca 29580 gagctcaagt gatcggccca tcttggcctc ccaaagtgct gagattacag gcacgagcca 29640 ccgcgcctga ccaacttttt atgtttaatc cttgtgaata ttcctagttt tggttaactg 29700 caataattgc aatacaaata gaataactgt ttctaacact tgttcaaggg cttgttcacg 29760 tattttttaa aaggatgcta acagatatga aagttctatg gcattatatt caatttgcta 29820 cacttagagt gacgtgcagt ctccgacaga ctgagcacaa caaattgttt ttaattttaa 29880 aaactgacat ggccaggcat ggtggctcac gactgtaatc ccagcatttg ggaggctgag 29940 gtaggcagat cacttgaggt cagcaattca agaccagcct ggacaatggt gaaactctgt 30000 ctctactaaa aatacaaaaa acttagctgg gcatcccagc tactcgggaa gctagggcat 30060 gaggattgct tgaacctggg aggcagaggt tgcagtgagc cgagatcgca ccactgcact 30120 ccagcctggg agacagagtg agactccatc tcaaaaataa taaataaata aataaataca 30180 taaatagtga tgtgattttt aacatgtatt tgcaattccc tgaaaagcct accctttgga 30240 atgctattaa attattacaa atgttaaatg ttgacttaaa aatgtgcaag gggctgggcg 30300 aggtggctca tgcctgtaat accagcactt cgggaggccg aatcgggtgg attgcttgtg 30360 gccaggagtt tgagaccagc ctaggcaaca tggcaaaact gtctctacag aaaatttaag 30420 aaattagcca gatgtggtgg cccgcacctg tagtcccagc tactcaggag gctgaggtgg 30480 gaagattgct tgaccctggg aggttgaggc tgtagtgagc caagatggca ccactgcact 30540 ccagtctggg caacagagtg agaccgtgtc tcaaaacaat acaaatgtgc aagggacata 30600 gtttttcaaa atcctttaaa gaggcaatca ggttagaagg acaggagctc agagatccca 30660 atggtctact gtcaatcaag tatccgacca gggttaggga tgaagagggg ttaaaagaaa 30720 ctgaggttgc ataaccttaa atttcaccac ttagaaccca gtttgcttat gtggtaactc 30780 tcattaaaaa ctacatatga gaggccgggc gcggtggctc acgcctgtaa tcccagcact 30840 tcgggaggcc aaggcgggcg aatcacgagg tcaggagatc gagaccatcc tggctaacat 30900 ggtgaaaccc agtctctact aaaaaataca aaaaaaatta gctgggcatg gtggccggcg 30960 cctgtagtcc cagctactcg ggaggctgag acaggagaat ggcgtgaacc tgggaggtgg 31020 agcttgcagt gagccgagat tgtgccactg cactccagcc tgggcgacag agcgagactc 31080 catctcaaaa aaacaaacaa ataaaaaacc aaaaaactac acatgagatc aggcgtggtg 31140 gctcacacct gtaatcctag cactttggga ggctgaggcg ggtggattac ctgaggtcag 31200 gagttcgaga ccagcctcac caacatggtg aatccctgtc tctactaaaa atacaaaaaa 31260 attagctggg catggtggcg ggcgcctgta atcccagctt ctcaggaggc tgaggcagga 31320 gaatccattg aacctgggag gcagaggttg cggtgagcca agatcgtgcc actgcactgc 31380 agcctgggcg acagagcaag accccgtctc agaaaacaaa aaacaaaaaa aaactacatg 31440 tggtccgaat gaaacaaaac taagcttagg gtttaggaat aatctgagaa cacataagaa 31500 ttgtaggttg agcctagtag aattaaatag gccccaagct ggactggatt cacccattca 31560 ttcattcatt atcttacttc ctcaatgtgt ccacgaatgc cgggtgccat gggagaatat 31620 aagaatataa ataataaaaa tatgtagttt ctactcagaa cttaaaattg agagagacag 31680 aatttacagg caagtttaaa taacatcaaa gacagtaaaa atgcatattt cctaataatg 31740 acatgagcga gcgccaatgt aatagccttg gcagtaaacg ccgtgagttc agaagagtca 31800 cggtgagctg gactagtcag gggaggcttc tgggaggagg gcccggagcg ggacctgaga 31860 gaagaacagg cagtgtgtct ggaggatgga ccaggaaggg cagacccgga gcctcataca 31920 gggtgcaggt acagaagctg cccccaggtg atgagctctc gtggccagaa ccaccagctc 31980 tagggaccag cccttgcgcg tatgtgcatc agccttcgtg tgtgctgttc cctatgtctg 32040 gaatggccgt cctctcccaa accagctgca tttctcctca gggatgcctc tgcctacacc 32100 actccttccc gcaccccacc cgacccccaa cgcccttcac cccagtcacc ctatggcaat 32160 gatttattca tgtctgtctt cccttcccag gccatgaacc ttgtgrggca gggactgtgt 32220 tctacgcatt tcttcttgaa cccctttacc atttttgtgc ctacggactc ccagagtgct 32280 aaatcactcc caacagcccc gcctatgcct ctgccgggac cttttccagg ggcagagagc 32340 tggaagcact tggaaatttt tctctcccac atcctcacat gccaccaccc tcccactccc 32400 ccagcccgcc cccaggcctt aaccaacggt ggacaaatat gaaggtgtca gtaccccagc 32460 cctccatgag acttagcttg gttccactca tgtgcttggg tcccactttc ccactccctt 32520 tccactcctc cccaccctca ttactttttt ttttttttaa gacagggtct cactctgtca 32580 cccaggctaa agtgcagtgg cacaatcata actcattgca gtctcaacct cctgggctca 32640 agtggtcctc ctgcctcagc cttctgagta gctggtacta tagatgcact ccactcactg 32700 ggctaatttt ttaatttctt gcagaaatga tgtcttgcca tgttgcccag gctggtctgg 32760 aactcctgga ctcaagcaat cttcctgcct tggcctccca aagcactggg attacaggtg 32820 tgagccatca tgcccagtcc cctcattact tttatttatt tatttattta tttattcaat 32880 ttttgagacg gagtctccct ctcgttgccc agactggaat gcagtggtgt gatctcagcc 32940 cactgcaatc tccgcctcct gagttcaagc gattctcctg cctctgcttc ctgagtagct 33000 gggattacag gcatgcgcca ctatgcccag ctaatttttg tatttttagt agaaacaggg 33060 tttcaccatg ttggccaggc tggtctcaaa ctcctgacct caggtgatct gcccgccttg 33120 gcctcccaaa gtgtcgagat tacaggcatg agccactgtg cctggcctat ttatttttga 33180 gacagttctc actctgttgc ccaggctgga gtacagtggc acgatcacag ctcactgaag 33240 cctggaccta agcgatcctc ccacctaagc ctcccaagta gctggatcac aggcgcatgc 33300 caccacgtct ggctaatttt ttttgtagag attgggtctt actatattgc ccaacctggt 33360 ctcaaactcc tgagctcaag aaaccctcct gcctccgcct ctcaaagtgt tgggattata 33420 ggcgtgagcc accctgccca acttctcatt agttttaaat aaatctcttt tacttgaatc 33480 tttgtctcag ggcctgcttc tggggaatcc aacctaggat gcaaagtatt tgctacacac 33540 tattgcaact actttctact gcgcatgtgc catagggcac tgttggtaaa tgctctacag 33600 cttaagctct cgtttaattt gcataacaat gctatcatga tcatttcaca gaagacagaa 33660 acaggcctag agaggtacag tgacccatgc aaggtcacac aggggacaaa tggcagaact 33720 gggatttcaa tttaggtctg tgctatgcta acaacactga ttttaaccac tacatcatcc 33780 cagctctttt tttttttttt tttttttttt tgagacggag tcttgctctt ttcacccagg 33840 ctagagtgca atggcacgat cttggctcac tgcaacctcc gcctcctggg ttaaagcaat 33900 tctcctgcct cagcctccca catggctggg attacaggca cccgccacca tacctggcta 33960 atttttgtag ttttttttta gtagacacgg ggtttcacca tgttggccag gctggtcttg 34020 aactcccgac ctcgtgatcc accagccttg gcttcccaaa gtgctgggat tacaagcata 34080 agccaccgcg cctggcccat cccagctctt tattcatctg tgtaaccctg acagagaata 34140 cagtgcctgg gccgtcatgc acacttaatg tgtgttttgt gaaaggctaa attatttaat 34200 gaagggccca attaacaaag agtagatcgg aatgattgga gtaaaataac ccgaagaaga 34260 gagagacatg ttggagagac aggtcggggg aaaattaggg aagatcttgg tgccaagtgc 34320 aggagctcat atctgaaagt ctctctcctc tattagaact gtgcctgggc ctgggcaaca 34380 taacaagacc ctgtctctga acaaacaaaa taagttagct gaacatggta gggcgcacct 34440 gtaatcccag ctattccaga ggctgaggtg gaagattgct tgagctcagg aggtcaaagc 34500 cagcctgggc aacacagcaa gaccccatct ctaaaaaaaa aaaaaattaa aattaaaaaa 34560 gggccaggca cagtggctca cacctgtaat cctagcactt tgggaggcca aggcaggagg 34620 atcgcttgag ctcaggagtt tgataccagt gtgggcaaca tagtgtgacc tcacctctac 34680 aaaaaaaatg tttaacattt ggccaggttg ccaggcgcag tggctcacgc ttgtaatccc 34740 agcactttgg gaggccgagg tgggcggatc gcgaggtcag gagatcgaga ccacggtgaa 34800 accccgtctc tactaaaaat acaaaaaaaa ttagccggga gaggtggcgg gcgcctgtag 34860 tcccagctac tcgggaggct gaggtaggag aatggcgtga acccgggagg cggacgttgc 34920 agtgagccga ggtcgcacca ctgcactcca gcctggacga cagagtgaga ctccatctca 34980 aaaaaaaaaa caaacaatta gccaggccat ggtagtgcat gcctgtagtc ccagctactc 35040 agcaggaaga tcacctgagc atgagaggtt gaagctggag tgagatatga ttgcaccact 35100 gcactccagc ttggatgaca gagctgtctc agaaaaaaaa aaaaaattgt gcctagggtg 35160 gggagaaaca catacatctc tgggtatact gtggcaggaa gctaaggata gaaaggaaga 35220 aggaggtctg gacccctcaa actgaccctc aagccaataa cgtggaatta gttaggagga 35280 aaaaaaatta attaattaat taatttttta ttttttgaga caggttcttg ctctgtcgcc 35340 caggctggac agtgcagagg tgcagttaca gctcactgca gccttgacct cctgggctca 35400 agggatcctc ctaccacagc gtcctgagta gctgggacca caggcatgtg ccaccatgtc 35460 cagctaagag aaattcttaa agaagagaga aaggagggaa aggaactgag cccctgatgt 35520 tgtctaggga aaaagctggg gctctttaca gcatgctgcc ttctttaatt ccacagcact 35580 atgtgggttt ccatgccttt atttccttgg aaagtatgag cttcttgaag acagcaactg 35640 tgccttgtct ttctttgtat cccttccttc tctcctagta cccagcctag aaggcactca 35700 ataaagcaaa tgattagccc catttcacag acgaggaacc aacactgaga gaggtaactc 35760 acctgtgcaa gtcatatcac aagtgccaaa gtcaagactg gatggaggac tgcctggctg 35820 caaaccaatt cttcccaggc tgacatggca ggtaggtgag tgggaaagag aagggggagg 35880 cataaggcaa ttggagattt tagtacctat taataggcag tggattttgg cactcaaaca 35940 ggctgtcttc attagctggg gaggagactg agtgggcctg gatggtatgg aggtatttgc 36000 acagggaaac ccattgtgct ggcttatcca ttcagattag acaatgctgg ttcctctcta 36060 cctgccttgg ctaagctcac ctaggagtaa atgccccagg gacaccgtca cgtctatgtc 36120 aacacagagt cacggaatta aataacagaa taggatcaca gatttacaga acaatagccc 36180 agaaccttgg acatgacaga tagttattaa atgcttggcc aatgaaaaaa aaagaaccta 36240 gaactagtat cacggtaaaa tctaattata caagctaagt taccttgaga aagcaccagg 36300 cacagccctg agccttgggc agcgaatgat gtctttggac tagacagaag agatggttgt 36360 ctgccctgct ttgaagctct ctggccaggg aaactccaaa ccattcattt gttcatccat 36420 ttgcccacac aatcaacatt cattgagcat ctgctctgtg gggtgctatg tgatggtgac 36480 agtcccagga agcagtttca gtcctccctg ccctcaaggg gctctgtgtt tagggaggac 36540 aaacatatac atcatgacaa taaaatttga taaaaagtta aattagagag ggggcaaacc 36600 ctgacgtagg agcccaggag ggactcctaa cttctctgcc aatttcatgt ttcaaaatta 36660 ttagacccaa gactcgcttt ggtataaact taagctctct tactgtacat ttttcttctc 36720 ctgctaatct ttaattaagt gcccaccagg tgcctggcac tgaatcaaaa ctcaaaaaac 36780 ttgctgaatt aagccaaatg cacctcctgt gggtttttcc ccctaatatc ctcagaggca 36840 gtaatcaatt ccctccccaa attccccagt ccccacccca tccccacctt tccttttgca 36900 gtataatctc aactcctgtg atgggggcag gcaggatagc atggtagtga gatcaacaga 36960 ctgtgacttg ggttcttgtc gctacttagc cattcattct gtatgtgacc ttgggatggg 37020 actttagggg gatttcttta ataacctcta aactcccgtg caatgctgag agccaaggta 37080 gcggctctca ggtcttgttc caagaactgc caccagaggg cagcctagag actcctctca 37140 ggtgttttcc tccagagcct ttgctctttc cctcactaat gtcatccact ccctgggtcc 37200 accatcaagg cacacaggtg tccctttagc cgtcaaggtg accgttctaa ggtgagccag 37260 gcatgtgagg cgaggcgaga caggctctga agccctcagg aaaattcagg cacaggctgc 37320 ttgtccaagt ggacctcacg atcatcattt acacattctc tccctgatta tttcatgagc 37380 cagcggtcta caggagagga taccacagcc agtcaaaggg atcagggccc tgccctcagg 37440 gagctgcctt ccagtgaggg aggagagaga tacacagatg cttacaaggt atctgtagta 37500 ggatgtcacc tagtgatgaa tggtgtgacg cagcttaggc agagcgagga aatagggatg 37560 gtcccaaacg tgtgtgtgta tgtgtgtttt tttgagacag agtttcgctc ttgtcgccca 37620 ggctggagtg cagtggtgcg atctcagctc actgcaacct ctacctcccg ggttcaagtg 37680 attctcctgt ctcagcctcc cgggtagctg ggattacaag tgttcaccac cacacccggc 37740 taatttttgt atttttagta gagatggggt ttcatcatgt tggccaggct gggctacgag 37800 cgaaactcca tctcaaaata ataataataa taataataat aataataata ataataataa 37860 taataattcc tgtctcctag ggtcattgag agaatggaga tcagtcctgc acgggcagac 37920 ctcggcaccg agctgaatgt tacacactgc caaggagtcg gcgccggtct ctggtgaaga 37980 tgttctagtc ccagggccca agagaacaga gcgggacacg agatgcctct ctaacaggag 38040 tctgtgtttc ctggtcacct ctgtgtgctg ggtgatgcct caggtgccag ggagacactg 38100 ggcagtagga gaagctccct cctgtgagga ctctcaccca gaggcactca gacagcccag 38160 aacacaggag ggaagatggg ccaccacagt cccattccta ggcgctccaa gggccagccc 38220 agcctagaag tccctcctgg tacaagtctt tccggcctct cctctcccga gagcacgttc 38280 agactgcacc ctggcgctct ctttcaccct gtcgaggaag ctgtgactct aattctgggt 38340 gaccagtaac aataaaaacc actaatagcc ctagtggaaa gaatgactat ttaacaggtg 38400 cacataatga gcagacaagg agagctcttt tccctcacac tcactccgca ggacagagag 38460 gggaacagtc ttgctactgt gacagctaat cacagcctct gtgtgcctcc ctcctcgcaa 38520 gataagaggg gtggcctcag ggtgtgacgg aaacagccag gaaggcttcg gccagggagc 38580 gactgcccta cagctgactc cctgctgctt ttgggaagtg ggggtgaagg gaaaagactc 38640 aggggagcgg ggggaagaaa gcttggtggg cagaacccca cttgccaaat tcctcctctt 38700 tcaggattca gtggccaaag gtgagcctgg ctctggcccc agtcttcact cccaagtgcc 38760 ggctgcagtg ccctgggccc gtcggaggct ggcagggctg ctgggagggt gaggacaggc 38820 atgggaagag tggaaagtgg gggctgaccc agccaagctc caggaggcca ttccatagag 38880 gagagagcca ttggaccagg agccaagcag ggaagaagga aggaaaggta gagccgagca 38940 ccgcccacac caggaataat ctcaggaaga ggatggtgtg gacaggcccc aggacagggc 39000 ttctcagcta ggggcaactc gttatgctgc agtgggggca cctaacagaa ccccctaggg 39060 tttttttaaa gtatacagca cggtcctgcc caccactggc atcagatgaa tgccatgtcc 39120 aggggaaacc ccagtacgtc ctcatcacag ccatgtgaac actgctgttt gttcataaaa 39180 atgtgttgca tccctagggg gaaataaaag gagctaatcc aatagagaga agagtggctc 39240 aggggagaca gtcacccgcg aggaatgagg gcaggacaca caggcccagg ccgagattgc 39300 acagggtgtt gcatgagtgg ctgtgtcaga gccaagtgca catgcccact gccacctccg 39360 ccaaggccat cagtgaatcc ctccagcacc tacaggggct ttgcacataa gagctgtgct 39420 tgcagggggt ggggggtggg cacagggcag gaaaccagag aggagggcac catgtccctg 39480 agtaaggtag aaaggggaag cacagcctgg acacattctt ccattctttt atttcaaaat 39540 atcaccaagt gcctgccaca gctatgtgct gggaactcag tgaacgcacc tgtgctttgc 39600 ccactggcat ggagcccacg gtccgggggt gatggtttct gttgcatttc aaagccactg 39660 gggcgctgca tagatggctg gctgcagagt ctatttgaga cgaaggcaag aagtcagggc 39720 ccatggactg tagattccac ccttgatcgt ctactacagg aaagaagcca cagaagtgag 39780 ggggtattcc caacaggaag gcagagaagg agagattaaa actgagtcaa gaagtgtagc 39840 cagtcccatg ggagcacctg ggctgaggag ggtcagtccc cgcagagatg ttcggaacac 39900 ctgattgggg atgtgccctc tccttctatg ttcttggtca ccagaggaac caggtgaaga 39960 gtgtggatga ctcagtccac ccctccccgc acaggaaaaa caactcagag gacatcccct 40020 actgggagtg ggccaggagc gctggcttcc acgccaagga gaggagggtg ttgctgtggc 40080 tgcagaggtg agtccacagg gaagcgtggg tgtcatggag aaagagagag gagaggaaat 40140 ggcctctcgc tggagccaag cagagaggaa caaaggaggg gaggagccca caggccctgg 40200 agccccagag cagctctggg tggccccttt ccttctctcc tcagcacaaa ataagctgct 40260 tttccaggct ttttctcttc ctcgtgctca ctccattttc cttcttccaa gcctaagccc 40320 tggtctacca aggaccccta tgtgaaacac ttgtgctaga acttctgtcc tgactctcag 40380 gcccccagag gaggaaagct gtggaggagg gacctcatga gccagtaacc ttcacttagg 40440 aactcttcac tttttttttt ttgagacgga gtctcactct gtcacccagg ctggagtgca 40500 gtggcatgaa atcggctcac tgcagcctcc gcctcgctgg ttcaaatgat tctcctgcct 40560 cagcctcctg agtagctggg attacaggtg cctgcccccg cacccggcta atttttgtat 40620 ttttagtaga gatggttttg ccatgttggc caggctggtc tcaaactcct gagctcaagt 40680 gatccacctg ccttggcctc ccagagtgct gggattacag gcgtgagcca ccatacctgg 40740 ccagctttga cagcagaaat cttctctcta gaaatattca tatacataag gaattttgca 40800 cacatcctta cagggtttag ggagcctctt aagcctgtcc atgaacctct gccattcatt 40860 tatagcatgg tgtcagctgc agagtaatag gatctatgtg ggcctggaag tccaacaggc 40920 ctgggtttga atctttttca aacatttttg cttgttagca gctggtttgg acaaataatt 40980 tcaacttagt tttctcatct gtgaaacagg gggaaaaagc acttaccctg tagggctgtt 41040 gtgaagattg aacaagacaa ctcatgttaa atacccaggc accgtgtccg cctgcagcag 41100 gcagtcagta catggtggat ctcgtgcccc aagcctcctc tttatggagt ccctcaagtt 41160 gccccagctg gcctcggatt cctcccacct ccacctcctg ggtagctggg actacagcac 41220 gccatgcttc agcccccttt gtgatcattc aaagccgttt ggttccctgc accctctcac 41280 atttactgca attcctatag aatccaaatt cttttaacct ggtgttgtag gtcatctaca 41340 atcccgatcc aagcagcctg tcccttctcc cacaccactg tcctccacat gcatgtccca 41400 caccaaccca cgccatccgc tggttctaaa tacacctgaa gcaccacaag gccccacgtc 41460 tggctgcctg ctctgtgctt ctctcatcat tcctcaaggc ccggggaaag tcctgccacc 41520 tccctgaagc caagtgccta cccgcttctc cactctaggc tgctctctta cgctgtgtgt 41580 ccccagctgg atggcagctc ctcagggcaa ggaacaattt ctcttctttc tcagtgcaac 41640 ttccaaaaca cagcagagga gcgtgcactg ggggcaaagg ctggcacagc aaacacagca 41700 gcacagggct gctggagggg tggaggggac cggaggggac agtgacctgc ctgcttcata 41760 atacatggct gggagcccca ccttttgcct atggaagcca tgtgctttga gcttgatttt 41820 ctcgggtgta gagtgtctac gccacccaca ctttctcacc cggtccaatg cacgggcacg 41880 ggcaggggca gggccagaat ctgttgctgt agaacacagc agagctgcct gcagtactgt 41940 ggacgtcacc aaactggcag cttcactgaa ggtgaccctg gcctgtgtgg gcactcagct 42000 gatgcccttc tcttccctcc taagccccat tagctttagg tcttcaccac ctgcactgag 42060 agaaagatta atggtgagca ccagagtaaa atctagcaca gaaggcaggg tttcagtaaa 42120 aaaccaaaag catggcagat tagagctcaa gggcccacag gtgggcagga cagtagagga 42180 actgggcagc ccagggtggc aggatggggt ggacaggagt ggacaggaga gagtagccag 42240 gaggagggac gtggctgcac caccagtcca cgtgggcggc tggagacctg gcaggaggcc 42300 aggagagact gtgcctcctc cagcctggct gtcttggcca agttctgatg gcagccagga 42360 ggactgtgag aaagaaaaca agacaaactt tgctacctct ggctggtcag gggaagtttg 42420 tgttggaaaa gaagaagcca cagattccac tgttccgctg ccttctcccc tcacaccctg 42480 ggacggccca gcaccttcta gtccactcct ccctgccctc tgccactccc ctcactaccc 42540 cagtgccccc tccctccttg cccaggtcct ccagggagcc aagcctgaga gtctagggag 42600 gcccacaaga ggaaatgatg ttcggaaaga gcagcatcac tttatttttt gagatggagt 42660 ttcccttttg tcacccaggc tggagtgcaa tggtgcaatc ttggctcact gcaacctctg 42720 cctccagggt tcaagtgatt ctcctgcctc agcctcccga gtagcaggga ttacaggcgc 42780 ccaccaccac gcctggctaa tttttttgta tttttggtag agacggggtt tcgccatgtt 42840 ggccaggctg atctcgaact cctggcctca agtgatccgc ccgcatcggc ctcccatagt 42900 gctgggatta caagcgtgag ccaccgtgcc cagccgagca gcatcacttt aaatggtatc 42960 tctgtctcac atttgtgcct gcacctctat acaaggcagg aagggagcag aagacactct 43020 gctagcattg gctgggctgg agactcagga aagggagcct ggggctgtgg gggtgaagct 43080 ggcctatttg tctcccccag tggaaggatg aagacaggag cacctgcagg gctccatttg 43140 ctgggcttcc tactggggga ctgcacccct tgtcccattt aacctttttg ataatttccc 43200 atttgatgtt cacaaaaacc ttgcgaaata agtgttattt aaaatacccc catttcacag 43260 atgaggagat ggaagttcag aaaggtttga agagccgccc aaagccaaga ggttagctgg 43320 gacttgaacc cacgtcatct tcattctgaa gccagtgttt tttgtttgtt tgtttttccc 43380 aaggcattag cccccttcag gggaccctgt cctatccctt cctctctcca cagctccctc 43440 agcccctctt tcctcccact cccattctga gtgtcactag caagtcaaga gcaagactag 43500 atggcagcca ctagctgcac gtggccaacc acatcattac aattaattaa aattaaagct 43560 ccaaatcttt ggtggttaaa cttgctccat gtggccagtg gctaccgtat cggacagcac 43620 agacaggaaa catttccatc ctcctgctgg tctgaactta aagggcggtc aggaggtaga 43680 gggaatggga gagaccagcc caaggggccc agggacagag ctgcctttgt gaggagatga 43740 ggtttgggaa gtgagtttgg gtgaggggag caggggtgtc tgaggcctgg gggaggagag 43800 ggcagcttgc tggacttgca ggcacctccc tccgatactg atactcccgc aatactagtg 43860 agcaattaat tggctggggc ggggcagggg ccagagcagg agggaggcca acgcaatgga 43920 ggagaagcca ccaccctctc agcagatggc agggctccgg ttctcctggc ctcctttcat 43980 cctttgagga cccgggagcc tgcctggccc tcctcgacct gcccagagct ggctgctgct 44040 tcctgctcgc tcatccagcc cagggccagc agatgggcta cggggccacc accttcccaa 44100 ttagccttct tagcctcttg ctcacccttg aggtctgggt taccagccat ggggaggggc 44160 tgtgagtggc ctgtgagtaa ccaaggccca gctttgacag aggtcatcag cccaggcccc 44220 tcccctggct gagccctggc tcccagcacc cccccccacc cccgccacca tgccgtcccc 44280 agaggccccc agaagcgaca ggtcagggag ttggttttgg gaatgtggtg ccccatttct 44340 gagaaggaga aggttcccca ggtgatggaa cacgtaatag gtatgtggct ggcctggtac 44400 ttggggagtg gtgggagggg gagcctgggg gatgggggca gaactcatat ttgaaggagg 44460 aaagtggagg tggagtgatg gggtcctaag gaaggagcag atcaggggct gtgaatggga 44520 cctcattggg gttgggtggg tggggctgga gcgggcaccc aagtgtggaa gaaaaagctg 44580 aaacacccac gactgcgagt ggagggcaga tggagagaca ggccaagcca cggtaggcag 44640 gagagttaag gagccaggca gctgggtccc gtggcaagag tggccgcccc agagtgggtg 44700 gccgtggggc agagcgcctg gttccgggtt aggcaatgag gagccggggc caggcctgtc 44760 aggtggcagg atcgttagag ccccgtggcc atgggtaccc cacactgcag ccactgctgc 44820 tgctgagtag gcagatgcac cgggctgatt accacgctcc tcccggccac accaacttcc 44880 cccggggcac ccaccccctc cacctctcct cctctcccca cagtgactcc tgcccaggga 44940 atgtccagct ctggcataaa ggacccaggt gtcctcgagc tgccatcagt caggaggccg 45000 tgcagcccga gatgggctcg tctcgggcac cctggatggg gcgtgtgggt gggcacggga 45060 tgatggcact gctgctggct ggtctcctcc tgccaggtag gaggctgggg gccctgggaa 45120 caggagggag gcgggaggga gactccggga gaggacccag cgaaggggac gggcaggggc 45180 tctggaatct gccttttgag tctgggggtt gctcctcact gtatggtcgc ctcaggtaag 45240 tttcttaaac ttcctgagcc ccagtttctg aaattctgaa gtggggttaa tgacacctac 45300 ctctagtctg tgtgtctcaa attaaataat gtatgtgata tgtactttgg aaattctaga 45360 ggtttatata aatggtggtg gtgattttta ttatgggagc actacaagat aatgattgga 45420 catttaatag taataatatc atttttagag cctttttata tgctagactc tgttttaagc 45480 acatttggat tatatattag aacttttatt tttatttttt ttgtgagatg gagtcccact 45540 ctgtctccaa ggctggagtg cagtggcgta atctcggctc actgcaactt ccacctctca 45600 ggttcaagcg actctcatgc ctcagcctct agagtagctg ggacaacagg tgcccatcac 45660 cacacctggc taattttctt ttttttgtat ttttagtaga aacagggttt taccattttg 45720 gtcaagctgg tcttgaactc ctgactcaag tgatccgctc gcctcggcct cccaaggtgc 45780 tgggattaca ggcatgagcc accacacccg gcctatatta gcacttttga tcattacaag 45840 aacggtatga aaagagattt gctatttcca ctctacagat gaggacactg aggctcggag 45900 aggttaggaa actagctcaa aatcatgcat tagaaggcag caaagccaag atttcaaccc 45960 caggccaggc aacccctgga cctgtgttgt tgaccaccgg gtacttatag cccttgagga 46020 atttctgcga ccttcccatg gtctagtggg tggttggtgt ctgagggaat agcgaaagag 46080 agaggcaatg catggtggat tcgtgcagag gactgaaggg aattggcaca gctggggttc 46140 ggcgtggagg tgcatgcaga gaatttcttt ctgaggagag aacagggaca tcacagagga 46200 tggcagtctg gttgttggtg gagggatcag gatgagtggc agtaataatt cataatatat 46260 aatgctttac actttctaaa acatctggcc gcacatgata gcttgtgcct gtaatcccaa 46320 cacttcagga ggccaaggca ggtgaatcgc ctgaggtcag gagttcaaga ccagcctggc 46380 caagatggtg aaaccccctc tctactaaaa atacaaaaaa ttagctgggt gtggtggcgg 46440 gcacctgtgg tcccagctac ttgggaggct gaggcaggag aatcgcttgc accaaggagg 46500 cagaggttac agtgagctga gaccgtgtta ttgcacttta gcctgggcaa caagaaactc 46560 catctcacaa aaaaaaaaaa aaaaaaaaaa aagaagaaaa aacttccagg tggatgatct 46620 catttagttt tcttcatagt aatgctgtgg gaaggcaggg aaaatttggc ccctctgaat 46680 gtataaacta aagctcagag aggttcagta acttgctagt atgtggctct gtttgtaaca 46740 cgtgggacct ggaggggcta gggaaggcag aaggaacgca ggtgaaagag tcatggagga 46800 accatggggt aagttgggcc tggggttttg agcaaaggaa aggaaagata aggaaagatg 46860 tggctccaca tccctgaggg aagtcaaggc agcagaagtc agatgagggg ctggacagag 46920 gcaggtgtgc tcagagaggg aagctgattg tggccaggag cctcggaggt tcgtggggtt 46980 tcgtcctggt tccctgggct gggccagcga gagcagggct ggctcagggt gcggtgtcct 47040 gacacactgg taccagcagg ttctgaagca acaggtagtg accccacatc ctggccccca 47100 cccagcttta ctggcatggc cagtgctgag ataggaaata gggtttccat tcctgacccc 47160 agcctgggct ctcacgaaga agctggtgac caaatcttag tcctcgagtg ccctttcctt 47220 tatttcagcc cctctgcccc cagctttgtc tttttccagt gtctccttct atatgtgtct 47280 ccacttctca gccctccatt gttttgcctt ttgtcttctt ccctctggtc ccactgtctg 47340 gcccaggatt tttcccctaa gaatttacgc ctggactcct cagagcctca gtttccccaa 47400 ttctctgtct cttcagggtc ctttctttta gacctatttg ttcctgcccc ttctccattc 47460 cctcttcttt ttaaaaaaaa ttttaattaa aaaacaaaat acagatgggg tctatgttgc 47520 ccaggctggt cttgaactct ggggcgcatg caatcctccc acctcagcct cccaaagtgc 47580 tgggattacc ggcgtgagcc actgtgccca gccccctctt atattcaatg tattcctttg 47640 aggtcactca ctttggcacg taattttcta tttttctggt tggtgtttgc ccacccttcc 47700 caaacaaaga aatgccttta ttcggccacc tcaatatcct ttagagacaa tagccagttc 47760 ttcctccttt ctccatccct aaactctccc tgcgctctgc ttgggagaaa cccgagaggc 47820 cgattactga gataaggcag aaaggtgagg gaggaagcca agcctctttg gcccttacta 47880 accactgctt tcctccacag ggaccttggc taagagcatt ggcaccttct cagacccctg 47940 taaggacccc acgcgtatca cctcccctaa cgacccctgc ctcactggga agggtgactc 48000 cagcggcttc agtagctaca gtggctccag cagttctggc agctccattt ccagtgccag 48060 aagctctggt ggtggctcca gtggtagctc cagcggatcc agcattgccc agggtggttc 48120 tgcaggatct tttaagccag gaacggggta ttcccaggtc agctactcct ccggatctgg 48180 ctctagtcta caaggtgcat ccggttcctc ccagctgggg agcagcagct ctcactcggg 48240 aagcagcggc tctcactcgg gaagcagcag ctctcattcg agcagcagca gcagctttca 48300 gttcagcagc agcagcttcc aagtagggaa tggctctgct ctgccaacca atgacaactc 48360 ttaccgcgga atactaaacc cttcccagcc tggacaaagc tcttcctctt cccaaacctc 48420 tggggtatcc agcagtggcc aaagcgtcag ctccaaccag cgtccctgta gttcggacat 48480 ccccgactct ccctgcagtg gagggcccat cgtctcgcac tctggcccct acatccccag 48540 ctcccactct gtgtcagggg gtcagaggcc tgtggtggtg gtggtggacc agcacggttc 48600 tggtgcccct ggagtggttc aaggtccccc ctgtagcaat ggtggccttc caggcaagcc 48660 ctgtccccca atcacctctg tagacaaatc ctatggtggc tacgaggtgg tgggtggctc 48720 ctctgacagt tatctggttc caggcatgac ctacagtaag ggtaaaatct atcctgtggg 48780 ctacttcacc aaagagaacc ctgtgaaagg ctctccaggg gtcccttcct ttgcagctgg 48840 gccccccatc tctgagggca aatacttctc cagcaacccc atcatcccca gccagtcggc 48900 agcttcctcg gccattgcgt tccagccagt ggggactggt ggggtccagc tctgtggagg 48960 cggctccacg ggctccaagg gaccctgctc tccctccagt tctcgagtcc ccagcagttc 49020 tagcatttcc agcagctccg gttcacccta ccatccctgc ggcagtgctt cccagagccc 49080 ctgctcccca ccaggcaccg gctccttcag cagcagctcc agttcccaat cgagtggcaa 49140 aatcatcctt cagccttgtg gcagcaagtc cagctcttct ggtcaccctt gcatgtctgt 49200 ctcctccttg acactgactg ggggccccga tggctctccc catcctgatc cctccgctgg 49260 tgccaagccc tgtggctcca gcagtgctgg aaagatcccc tgccgctcca tccgggatat 49320 cctagcccaa gtgaagcctc tggggcccca gctagctgac cctgaagttt tcctacccca 49380 aggagagtta ctcgacagtc cataagtcaa ctgttgtgtg tgtgcatgcc ttgggcacaa 49440 acaagcacat acactatatc ccatatggga gaaggccagt gcccaggcat agggttagct 49500 cagtttccct ccttcccaaa agagtggttc tgctttctct actaccctaa ggttgcagac 49560 tctctcttat caccccttcc tccttcctct tctcaaaatg gtagattcaa agctcctctc 49620 ttgattctct cctactgttt aaattcccat tccaccacag tgcccctcag ccagatcacc 49680 accccttaca attccctcta ctgtgttgaa atggtccatt gagtaacacc cccatcacct 49740 tctcaactgg gaaacccctg aaatgctctc agagcacctc tgacgcctga agaagttata 49800 ccttcctctt cccctttacc aaataaagca aagtcaaacc atcatctgga aacagtggcc 49860 acttttcact gacctctctt cgacatctag tcaacccacc caatatgcca ctgggctttc 49920 gctcccaatt ccaccccacc ctccattaca gagctcacca cgccctccta gatcaccgtc 49980 cccaacacac ccattgcctc tcaaggccct tatctcagcc ccttcctgtg gccatttccc 50040 tcagtgccca gatgattccc tgggtgaggg agacactggg gcaccctcag aggttggagc 50100 aggctccctg ctgtccctgg atcctggaca gatggctcag taaactgtgg ggactaggtg 50160 cagacttttt gccttcttgg agtcctgggt ctcctctgag agtctgggtg gtgctcttcc 50220 tacgcctcta gaggtctctg tgtccctcat tttccttcaa aagcgggctg tgtttctctt 50280 ctaccttcca gctcctccca cagaggagga agacaataaa tatttgttga actgaaagca 50340 gagattgcct ggcctcccag atccttccgc catttccctc ctctctcatt gctccaggaa 50400 atccattctc ttcccattcc tcattcaccg tggggtcccc cttcccctta tttagggccc 50460 tcagtgtttt ctctccctcc cctcccctcc cctccccacc caaactcctt ttcttccacc 50520 attagcattc ctcaccttct agatgccatc ctctctggga gtcatgagtc tcgatttcct 50580 gggtttctgg gacacctgga agcttgggaa ggctgggaca caacaactcc aaccagattc 50640 ctgtcagctg agtaggaggc cagttgggcg ttgttcctgg agctgggggt ggagagagta 50700 aaggactgag aggatgggag cggggcaggg agtgcagcca agcagggtga ctcactggcc 50760 tagatcaaga ggcccagcct gtggcagaac agagctgcca gtggtctctc catcttcaca 50820 ctccctgctc tgctggggtc cagagtgaga gtgtgagcaa catggctctc aggtgagggc 50880 tgagaaggca gagtgcccca gtgggaaaga ggagtcgctt ccactggaga agagagagaa 50940 agtggagtgt gtggtggggt ccatgcgact taagtcctga gacaggcagg gagaggctga 51000 ggcggacgaa gttcccgcat cccaaggagg gcagagtgga ttgtgcttgt ccctgtagga 51060 gccccacccc ccaccccagg ccacctctca gagcctctgc ttggctgcaa aggaattcac 51120 ccctactgta gcacttaacc cattccctcc tatcagggtg gtgctgtctg gtcctgaatt 51180 tagaactgtt gaaactccaa gtctggaatc agcaaaaatg tattacattg accagaaagg 51240 gattgaatca cccttggtcc agcatctggc ccctgatctg cagccaatgg caggaatcga 51300 ggtcctcaga tgcttcatga atgggaattg cagggagaga aggctctctg atgtggtgtt 51360 tcctcgagtc tcctgctgtg ctccaaatta aaagcttgtg taaaactcat gcatgtcatc 51420 caaaaaggcc tctgggctcc atccactgcc agttctggag aggagctctt cactcctcca 51480 gtggttaagc cagcaggggc aggtggggag gacacagcag tagaatcagc caacagctca 51540 tgtttagacc ttgggcagcc agggaagcct actcctgggg cctcccggaa gccatggaga 51600 gaacaaagcc attgcatttt tataataaaa tttgcaaaca tatttaaaag ccaacaaact 51660 gttaatgaat ctctacattc tcatcgccca gcttcaacaa ggatcccagc ttcaacaagg 51720 atcaagtcct ggccatttga cagcagcatt taaaggctct cctctactgt tacttggaaa 51780 tagccacttt ctcccaaggt ttcttatact ctgtggcaca tctgaccacc agtagcaggc 51840 agaatgatgt cttcaacccc aacaccatca aagatgtcca catcctaatc cctggaacgt 51900 aggaattagg ttacatggca aagggaaatt aaggttccag atgggattaa ggttgctatt 51960 cggctgactt cacagagatt atcatggatt attcaggtgg gtccagtgta gtcaccaggt 52020 cccttaatgt ggacatggga ggcagaagag gaagtctgag tgatacagtg taagaaatgg 52080 ctgattttgg ctttggagat ggaggaaggg gaccatgagc caaagaacac aggatgcctc 52140 tagaaggtga aaaagcaggg aaagggattt tcccctgagg cccccagaaa gaatcacagc 52200 cctgctgaca cctttatttt aatccactga gacctgtttt agacttctga tctccaaaac 52260 tgtaaagtaa taaatccatg ttgttgtaag ccattcggtt catggtaatt tgtcactgca 52320 gcagcaggaa ttagtcagta tctcataagg atggcatcca ggtccatttc cctagctaga 52380 tccagggtct catgtaggag cagctcctca gatggggcca cttctgcacc ccagaacctc 52440 ctgcaggttg gggccaaggt gaaggagata tgaggatgca tgagaaaggg gtgctgggag 52500 gaaacaatcc agctcccaaa aagaaacaag tgtttctgtt gctgagagag gcaattaaga 52560 gagtgggacc ccagggtgga ggtccttgtg tatagagaag cagggctggg gaggctggca 52620 accagggatg agctgtgagc caggacacct gggccaagaa ggggcaggga ggtcaaggaa 52680 aggagccagg gcgggagaca cccagcttcc tctgggacat tcattcaagt gacacctgtt 52740 gccacagacc acattaggaa tgagggtgga atgtggaggt ttattgtctt cacaaccact 52800 agcccagcct gtttctgctg tcccccaccc cactaccagg ataaagggct ggctgtcttg 52860 gggctgaggg agatcgggtg ctgagcagga tgcagggccg cgtggcaggg agctgcgctc 52920 ctctgggcct gctcctggtc tgtcttcatc tcccaggtat ggaggccgtg atgcccttgg 52980 gcaggaggga ctggaggtcc cccaggaaac aggaattaag gaaaggggta aaggcaggag 53040 ggtacacatt taggtccctg agggaaaagg aagaataggc ataggggaag caaagggaac 53100 tggggactcg gggactggag accactggtt gctttatctt ccctttccct caggcctctt 53160 tgcccggagc atcggtgttg tggaggagaa agtttcccaa aacttgggga ccaacttgcc 53220 tcagctcgga caaccttcct ccactggccc ctctaactct gaacatccgc agcccgctct 53280 ggaccctagg tctaatgact tggcaagggt tcctctgaag ctcagcgtgc ctgcatcaga 53340 tggcttccca cctgcaggag gttctgcagt gcagaggtgg cctccatcgt gggggctgcc 53400 tgccatggat tcctggcccc ctgaggatcc ttggcagatg atggctgctg cggctgagga 53460 ccgcctgggg gaagcgctgc ctgaagaact ctcttacctc tccagtgctg cggccctcgc 53520 tccgggcagt ggccctttgc ctggggagtc ttctcccgat gccacaggcc tctcacccaa 53580 ggcttcactc ctccaccagg actcggagtc cagacgactg ccccgttcta attcactggg 53640 agccggggga aaaatccttt cccaacgccc tccctggtct ctcatccaca gggttctgcc 53700 tgatcacccc tggggtaccc tgaatcccag tgtgtcctgg ggaggtggag gccctgggac 53760 tggttgggga acgaggccca tgccacaccc tgagggaatc tggggtatca ataatcaacc 53820 cccaggtacc agctggggaa atattaatcg gtatccagga ggcagctggg gaaatattaa 53880 tcggtatcca ggaggcagct gggggaatat taatcggtat ccaggaggca gctgggggaa 53940 tattcatcta tacccaggta tcaataaccc atttcctcct ggagtcctcc gccctcctgg 54000 ctcttcttgg aacatcccag ctggcttccc taatcctcca agccctaggt tgcagtgggg 54060 ctagagcacg atagagggaa acccaacatt gggagttaga gtcctgctcc cgccccttgc 54120 tgtgtgggct caatccaggc cctgtcagca tgtttccagc actatcccca cttttcagtg 54180 cctcccctgc tcatctccaa taaaataaaa gcacttatgg aatttgcttc tccttggttt 54240 ctttgtttct gggcataagc tgaagtgagt ctgggcataa gctgaagtga gtctgttcat 54300 tcctgttttc tagccatccc cacggccctc taggggcccc tgcagacgct gtcttgctat 54360 ccccatcctt cacaaaggat cagtgcccaa gtgcttgagg gtggagcctc agtctcaccc 54420 cggccaggtg ggagagctgt tccagaattg tgctggaatc tgaaaggggg aggagggaca 54480 gcaggactaa ttgagatggc acctgcagca gggggcaagg atgaggtccc agaaggcggc 54540 tccagggcca ggtggacagg attccttgca actcacagaa acaggaagcc aaaagtcgca 54600 atgtctacgc ttcacttgtc ttttcttccc cggaaagtca agcttcttgg aagtggaggt 54660 acatcgacct cctcccttac aggcatcatt tagcacattg tgtcccacag aaccacagac 54720 tttgaagagt tgctgagtaa atagcagacc tcgataaagg aaaagagaaa agggagaaag 54780 gaaagggaga aaaaaacctt gaagccaaca atcccacctg gggtggcatt tgatgctttc 54840 attcccaagt gatgacacag tctcagcctt tggtcacagt attgtctctc ctgccctccc 54900 ttcggttttc ccaggagctc aacatcctca cacaggagtt ggagtgacgg cagcgaaggg 54960 tcaggctaca aaagcacgga agaatcagca ggtgtgggtt ggaggtgatt tgggtctgga 55020 ttctctcttc cctgtgccat gcctgcagtg 55050 5 152 PRT Homo sapiens 5 Met Thr Cys Thr Asp Gln Lys Ser His Ser Gln Arg Ala Leu Gly Thr 1 5 10 15 Gln Thr Pro Ala Leu Gln Gly Pro Gln Leu Leu Asn Thr Asp Pro Ser 20 25 30 Ser Lys Glu Thr Arg Pro Pro His Val Asn Pro Asp Arg Leu Cys His 35 40 45 Met Glu Pro Ala Asn His Phe Trp His Ala Gly Asp Leu Gln Ala Met 50 55 60 Ile Ser Lys Glu Phe His Leu Ala Ala Thr Gln Asp Asp Cys Arg Lys 65 70 75 80 Gly Arg Thr Gln Glu Asp Ile Leu Val Pro Ser Ser His Pro Glu Leu 85 90 95 Phe Ala Ser Val Leu Pro Met Ala Pro Glu Glu Ala Ala Arg Leu Gln 100 105 110 Gln Pro Gln Pro Leu Pro Pro Pro Ser Gly Ile His Leu Ser Ala Ser 115 120 125 Arg Thr Leu Ala Pro Thr Leu Leu Tyr Ser Ser Pro Pro Ser His Ser 130 135 140 Pro Phe Gly Leu Ser Ser Leu Ile 145 150 6 63 PRT Homo sapiens 6 Met Thr Cys Thr Asp Gln Lys Ser His Ser Gln Arg Ala Leu Gly Thr 1 5 10 15 Gln Thr Pro Ala Leu Gln Gly Pro Gln Leu Leu Asn Thr Asp Pro Ser 20 25 30 Ser Lys Glu Thr Arg Pro Pro Thr Leu Ile Leu Thr Asp Phe Ala Thr 35 40 45 Trp Ser Gln Gln Thr Ile Ser Gly Met Gln Gly Thr Ser Lys Gln 50 55 60 7 41 PRT Homo sapiens 7 Met Thr Cys Thr Asp Gln Lys Ser His Ser Gln Arg Ala Leu Gly Thr 1 5 10 15 Gln Thr Pro Ala Leu Gln Gly Pro Gln Leu Leu Asn Thr Asp Pro Ser 20 25 30 Ser Lys Glu Thr Arg Pro Pro Pro Arg 35 40 8 46 DNA Homo sapiens 8 cttctagatg ccatcctctc tgggagtcat gagtctcgat ttcctg 46 9 160 DNA Homo sapiens 9 gctttgccca ctggcatgga gcccacggtc cgggggtgat ggtttctgtt gcatttcaaa 60 gccactgggg cgctgcatag atggctggct gcagagtcta tttgagacga aggcaagaag 120 tcagggccca tggactgtag attccaccct tgatcgtcta 160 10 77 DNA Homo sapiens 10 gtgcaagtca tatcacaagt gccaaagtca agactggatg gaggactgcc tggctgcaaa 60 ccaattcttc ccaggct 77 11 31 DNA Homo sapiens 11 gagagctctt tgagagtgac ttttttggtc t 31 12 124 DNA Homo sapiens 12 agaaatggtt tgctggctcc atgtggcaaa gtcggtcagg attaacgtgg gggggacgag 60 tttccttgga gctgggatct gtgttaagga gctggggtcc ttgtaaagct ggggtctgtg 120 tgcc 124 13 421 DNA Homo sapiens 13 gcggaccagg cagggaacaa cctgggagga atcaaatttt attttggaca tgttacttct 60 gaaaggctaa cagacttcca ggcagaaagg tccttgaggg aaacattcta ggggtctctc 120 tgggaggctt agatcaagga gctgagacca aaaggagaat gggagggagg agacgagtac 180 aatagagttg gagccaaggt cctagaggcg gataggtgga ttcctgaggg aggaggaagg 240 ggctgaggtt gctggagcct ggcagcttct tccggagcca ttggcaggac tgatgcaaac 300 agctctgggt gggaagaggg aaccaggata tcctcctgtg tccttccttt tctgcagtca 360 tcctgggtgg ctgccagatg gaattccttg gatatcattg cttggaggtc ccctgcatgc 420 c 421 14 15 PRT Homo sapiens 14 Met Ile Ser Lys Glu Phe His Leu Ala Ala Thr Gln Asp Asp Lys 1 5 10 15 15 15 PRT Homo sapiens 15 Met Ile Ser Lys Glu Phe His Leu Ala Ala Thr Gln Asp Asp Cys 1 5 10 15 16 22 DNA Artificial sequence primer 16 daggtgttcc gaacatctct gc 22 17 22 DNA Artificial sequence primer 17 dacagcctgg acacattctt cc 22 18 21 DNA Artificial sequence Primer 18 daagacagcc tgtttgagtg c 21 19 21 DNA Artificial sequence Primer 19 dtgtatccct tccttctctc c 21 20 21 DNA Artificial sequence Primer 20 daaaggtaag aggtggtgag g 21 21 21 DNA Artificial sequence Primer 21 datctggctc accagaaatg g 21 22 21 DNA Artificial sequence Primer 22 dtttcaaacc tgggatgcag c 21 23 21 DNA Artificial sequence Primer 23 dagatgagat cacgccattg c 21 24 21 DNA Artificial sequence Primer 24 datgcctgta aaggaggaag g 21 25 21 DNA Artificial sequence Primer 25 daaagtgggt caagtgaacg g 21 26 21 DNA Artificial sequence Primer 26 dtaagctcca tccacccctg g 21 27 21 DNA Artificial sequence Primer 27 daactggacg catggggttg g 21 28 21 DNA Artificial sequence Primer 28 datgggatcc aggcatcctg c 21 29 21 DNA Artificial sequence Primer 29 dtttggacag ggtgtggagg g 21 30 61 DNA Homo sapiens 30 ccccaatcag gtgttccgaa catctctgcg rrgactgacc ctcctcagcc caggtgctcc 60 y 61 31 61 DNA Homo sapiens 31 cccaatcagg tgttccgaac atctctgcgr rgactgaccc tcctcagccc aggtgctccy 60 a 61 32 61 DNA Homo sapiens 32 rrgactgacc ctcctcagcc caggtgctcc yatgggactg gctacacttc ttgactcagt 60 t 61 33 61 DNA Homo sapiens 33 gtagacgatc aagggtggaa tctacagtcc rtgrgccctg acttcttgcc ttcrtctcaa 60 a 61 34 61 DNA Homo sapiens 34 gacgatcaag ggtggaatct acagtccrtg rgccctgact tcttgccttc rtctcaaata 60 g 61 35 61 DNA Homo sapiens 35 acagtccrtg rgccctgact tcttgccttc rtctcaaata gactctgcag ccagccatct 60 a 61 36 61 DNA Homo sapiens 36 attaataggt actaaaatct ccaattgcct yatgcctccc ccttctcttt cccactcacc 60 t 61 37 61 DNA Homo sapiens 37 gtgagttacc tctctcagtg ttggttcctc ktctgtgaaa tggggctaat catttgcttt 60 a 61 38 61 DNA Homo sapiens 38 cccagcccca ggaggaggag cctgtctgga yggacgcagc ctgaactgac ccacaaacag 60 a 61 39 61 DNA Homo sapiens 39 taggtttgta aatacttaac tgatggtaaa rtgtcatgaa cccctacccc cgatggatct 60 g 61 40 61 DNA Homo sapiens 40 gctttgtcct caggccaacc tgcaacccaa rgtgggttac accttggccc ccaggcacac 60 a 61 41 61 DNA Homo sapiens 41 ccaggcacac agaccccagc tttacaagga mcccagctcc ttaacacaga tcccagctcc 60 r 61 42 60 DNA Homo sapiens 42 ycccagctcc ttaacacaga tcccagctcc raggaaactc rtccccccca cgttaatcct 60 43 60 DNA Homo sapiens 43 ttaacacaga tcccagctcc raggaaactc rtccccccca cgttaatcct gaccgacttt 60 44 67 DNA Homo sapiens 44 aacacagatc ccagctccra ggaaactcrt cccccccacg ttaatcctga ccgactttgc 60 cacatgg 67 45 66 DNA Homo sapiens 45 aacacagatc ccagctccra ggaaactcrt ccccccacgt taatcctgac cgactttgcc 60 acatgg 66 46 68 DNA Homo sapiens 46 aacacagatc ccagctccra ggaaactcrt ccccccccac gttaatcctg accgactttg 60 ccacatgg 68 47 61 DNA Homo sapiens 47 tctgcaccat gtcccccacc caatgtgtcc wgaaagccat ttctggtgag ccagatgcac 60 c 61 48 61 DNA Homo sapiens 48 catttctggt gagccagatg caccttctgc rtcccctgaa ttcctgtccc caaccccatg 60 c 61 49 61 DNA Homo sapiens 49 tccacctatc cgcctctagg accttggctc yaactctatt gtactcgtct cctccctccc 60 a 61 50 61 DNA Homo sapiens misc_feature (31)..(31) n is c or no base 50 ctccttgatc taagcctccc agagagaccc ntagaaygtt tccctcaagg acctttctgc 60 c 61 51 61 DNA Homo sapiens 51 gatctaagcc tcccagagag acccctagaa ygtttccctc aaggaccttt ctgcctggaa 60 g 61 52 74 DNA Homo sapiens 52 gaaatagcca cyttctccca aggtttctta tactctrtgg cacatctgac caccagtagc 60 aggcagaatg atgt 74 53 54 DNA Homo sapiens 53 ctcctctact gttacttgga aatagccacy ttctcccaag gtttcttata ctct 54 54 81 DNA Homo sapiens 54 gatcaagtcc tggccatttg acagcagcat ttaaaggcyc tcctctactg ttacttggaa 60 atagccacyt tctcccaagg t 81 55 66 DNA Homo sapiens 55 catgtttaga ccttgggcag ccagggaagc ytactcctgg ggcctcccgg aagccatgga 60 gagaac 66 56 76 DNA Homo sapiens 56 ctcttcactc ctccagtggt taagccagca ggggcaggyg gggaggacac agcagtagaa 60 tcagccaaca gctcat 76 57 70 DNA Homo sapiens 57 aggcctctgg gctccatcca ctgccagttc tggagwggag ctcttcactc ctccagtggt 60 taagccagca 70 58 80 DNA Homo sapiens 58 acattgacca gaaagggatt gaatcaccct tggtccagcr tctggcccct gatctgcagc 60 caatggcagg aatcgaggtc 80 59 80 DNA Homo sapiens 59 tgaatttaga actgttgaaa ctccaagtct ggaatcagca raaatgtatt acattgacca 60 gaaagggatt gaatcaccct 80 60 75 DNA Homo sapiens 60 ctcagagcct ctgcttggct gcaaaggaat tcacccytac tgtagcactt aacccattcc 60 ctcctatcag ggtgg 75 61 76 DNA Homo sapiens 61 ggattgtgct tgtccctgta ggagccccac cccccacccy aggccacctc tcagagcctc 60 tgcttggctg caaagg 76 62 75 DNA Homo sapiens 62 tgagacaggc agggagaggc tgaggcggas gaagttccyg catcccaagg agggcagagt 60 ggattgtgct tgtcc 75 63 76 DNA Homo sapiens 63 gacttaagtc ctgagacagg cagggagagg ctgaggcgga sgaagttccy gcatcccaag 60 gagggcagag tggatt 76 64 78 DNA Homo sapiens 64 gctgagaagg cagagtgccc cmgtgggaaa gaggagtcgc ytccactgga gaagagagag 60 aaagtggagt gtgtggtg 78 65 79 DNA Homo sapiens 65 aacatggctc tcaggtgagg gctgagaagg cagagtgccc cmgtgggaaa gaggagtcgc 60 ytccactgga gaagagaga 79 66 79 DNA Homo sapiens 66 tagatcaaga ggcccagcct gtggcagaac agagctgccr gtggtctctc catcttcaca 60 ctccctgctc tgctggggt 79 67 76 DNA Homo sapiens 67 tctcagcccc ttcctgtggc catttccctc agtgcycaga tgattccctg ggtgagggag 60 acactggggc accctc 76 68 71 DNA Homo sapiens 68 taccccaagg agagttactc gacagtccat aagaagtcaa ctgttgtgtg tgtgcatgcc 60 ttgggcacaa a 71 69 72 DNA Homo sapiens 69 agttcccaat csagtggcaa aatcatcctt cagccttgyg gcagcaagtc cagctcttct 60 ggtcaccctt gc 72 70 74 DNA Homo sapiens 70 ggcaccggct ccttcagcag cagctccagt tcccaatcsa gtggcaaaat catccttcag 60 ccttgyggca gcaa 74 71 67 DNA Homo sapiens 71 ccagcrgttc tagcatttcc agcagckccg gttyacccta ccatccctgc ggcagtgctt 60 cccagag 67 72 69 DNA Homo sapiens 72 ctcgagtccc cagcrgttct agcatttcca gcagckccgg ttyaccctac catccctgcg 60 gcagtgctt 69 73 76 DNA Homo sapiens 73 ccaagggacc ctgctctccc tccagttctc gagtccccag crgttctagc atttccagca 60 gckccggtty acccta 76 74 75 DNA Homo sapiens 74 accccatcat ccccagccag tcggcagctt cctcggccat tgcrttccag ccagtgggga 60 ctggtggggt ccagc 75 75 71 DNA Homo sapiens 75 ccaggcatga cctacagtaa gggtaaaatc taycctgtgg gctacttcac caaagagaac 60 cctgtgaaag g 71 76 81 DNA Homo sapiens 76 cagggacctt ggctaagagc attggcacct tctcagaccy ctgtaaggac cccacgcgta 60 tcacctcccc taacgacccc t 81 77 84 DNA Homo sapiens 77 kactgagata aggcagaaag gtgaggragg aagccaagcc tcyttggccc ttactaacca 60 ctgctttcct ccacagggac cttg 84 78 73 DNA Homo sapiens 78 agaggccgat kactgagata aggcagaaag gtgaggragg aagccaagcc tcyttggccc 60 ttactaacca ctg 73 79 73 DNA Homo sapiens 79 ccctgcgctc tgcttgggag aaacccgaga ggccgatkac tgagataagg cagaaaggtg 60 aggraggaag cca 73 80 74 DNA Homo sapiens 80 tcaatgtatt cctttgaggy cactcacttt ggcacstaat tttctatttt tctggttggt 60 gtttgcccac cctt 74 81 73 DNA Homo sapiens 81 agccccctct tatattcaat gtattccttt gaggycactc actttggcac staattttct 60 atttttctgg ttg 73 82 74 DNA Homo sapiens 82 tcttgaactc tggggcrcat gcaatcctcc cacctcrgcc tcccaaagtg ctgggattac 60 cggcgtgagc cact 74 83 76 DNA Homo sapiens 83 gggtctatgt tgcccaggct ggtcttgaac tctggggcrc atgcaatcct cccacctcrg 60 cctcccaaag tgctgg 76 84 74 DNA Homo sapiens 84 aaaaaaattt taattaaaaa acaaaataca gayrgggtct atgttgccca ggctggtctt 60 gaactctggg gcrc 74 85 74 DNA Homo sapiens 85 aaaaaaattt taattaaaaa acaaaataca gayrgggtct atgttgccca ggctggtctt 60 gaactctggg gcrc 74 86 67 DNA Homo sapiens 86 ctgtctcttc agggtccttt cttttagacc tayttgttcc tgccccttct ccattccctc 60 ttctttt 67 87 72 DNA Homo sapiens 87 ggaggaacca yggggtaagt tgggcctggg gttttsagca aaggaaagga aagataagga 60 aagatgtggc tc 72 88 68 DNA Homo sapiens 88 cagaaggaac gcaggwgaaa gagtcatgga ggaaccaygg ggtaagttgg gcctggggtt 60 ttsagcaa 68 89 75 DNA Homo sapiens 89 ctggaggggc tagggaaggc agaaggaacg caggwgaaag agtcatggag gaaccayggg 60 gtaagttggg cctgg 75 90 164 DNA Homo sapiens 90 aggtgttccg aacatctctg cgrrgactga ccctcctcag cccaggtgct ccyatgggac 60 tggctacact tcttgactca gttttaatct ctccttctct gccttcctgt tgggaatacc 120 ccctcacttc tgtggcttct ttcctgtagt agacgatcaa gggt 164 91 86 DNA Homo sapiens 91 aggtgttccg aacatctctg cgrrgactga ccctcctcag cccaggtgct ccyatgggac 60 tggctacact tcttgactca gtttta 86 92 71 DNA Homo sapiens 92 tctctgcgrr gactgaccct cctcagccca ggtgctccya tgggactggc tacacttctt 60 gactcagttt t 71 93 84 DNA Homo sapiens 93 tcaagggtgg aatctacagt ccrtgrgccc tgacttcttg ccttcrtctc aaatagactc 60 tgcagccagc catctatgca gcgc 84 94 80 DNA Homo sapiens 94 gggtggaatc tacagtccrt grgccctgac ttcttgcctt crtctcaaat agactctgca 60 gccagccatc tatgcagcgc 80 95 82 DNA Homo sapiens 95 atctacagtc crtgrgccct gacttcttgc cttcrtctca aatagactct gcagccagcc 60 atctatgcag cgccccagtg gc 82 96 80 DNA Homo sapiens 96 cctattaata ggtactaaaa tctccaattg cctyatgcct cccccttctc tttcccactc 60 acctacctgc catgtcagcc 80 97 89 DNA Homo sapiens 97 ggcacttgtg atatgacttg cacaggtgag ttacctctct cagtgttggt tcctcktctg 60 tgaaatgggg ctaatcattt gctttattg 89 98 83 DNA Homo sapiens 98 cagccccacc cagccccagc cccaggagga ggagcctgtc tggayggacg cagcctgaac 60 tgacccacaa acagaccaaa aaa 83 99 114 DNA Homo sapiens 99 accaaaaaag tcactctcaa agagctctcg gtaggtttgt aaatacttaa ctgatggtaa 60 artgtcatga acccctaccc ccgatggatc tgaaccgttc acttgaccca cttt 114 100 81 DNA Homo sapiens 100 cactagcttt gtcctcaggc caacctgcaa cccaargtgg gttacacctt ggcccccagg 60 cacacagacc ccagctttac a 81 101 131 DNA Homo sapiens 101 tcaggccaac ctgcaaccca argtgggtta caccttggcc cccaggcaca cagaccccag 60 ctttacaagg amcccagctc cttaacacag atcccagctc craggaaact cgtccccccc 120 acgttaatcc t 131 102 118 DNA Homo sapiens 102 tcacagaccc cagctttaca aggamcccag ctccttaaca cagatcccag ctccraggaa 60 actcrtcccc cccacgttaa tcctgaccga ctttgccaca tggagccagc aaaccatt 118 103 60 DNA Homo sapiens 103 ttaacacaga tcccagctcc raggaaactc rtccccccca cgttaatcct gaccgacttt 60 104 68 DNA Homo sapiens 104 aacacagatc ccagctccra ggaaactcrt ccccccccac gttaatcctg accgactttg 60 ccacatgg 68 105 97 DNA Homo sapiens 105 agccaaatgc accttctgca ccatgtcccc cacccaatgt gtccwgaaag ccatttctgg 60 tgagccagat gcaccttctg crtcccctga attcctg 97 106 103 DNA Homo sapiens 106 gcaccatgtc ccccacccaa tgtgtccwga aagccatttc tggtgagcca gatgcacctt 60 ctgcrtcccc tgaattcctg tccccaaccc catgcgtcca gtt 103 107 89 DNA Homo sapiens 107 tcctccctca ggaatccacc tatccgcctc taggaccttg gctcyaactc tattgtactc 60 gtctcctccc tcccattctc cttttggtc 89 108 85 DNA Homo sapiens 108 cctcccattc tccttttggt ctcagctcct tgatctaagc ctcccagaga gacccctaga 60 aygtttccct caaggacctt tctgc 85 109 84 DNA Homo sapiens 109 attctccttt tggtctcagc tccttgatct aagcctccca gagagacccc tagaaygttt 60 ccctcaagga cctttctgcc tgga 84 

1. A method of diagnosing for, or determining susceptibility to, inflammatory disease such as psoriasis, comprising determining the presence of one or more polymorphisms in the SEEK1 gene or protein.
 2. A method according to claim 1, comprising determining the presence of a nucleotide substitution, deletion or insertion at any one or more of positions 51814, 51789, 51759, 51570, 51505, 51462, 51265, 51216, 51124, 51078, 51017, 51008, 50920, 50901, 50801, 50049, 49405-49407, 49160, 49133, 49045, 49038, 49017, 48920, 48773, 47938, 47868, 47852, 47826, 47661, 47645, 47567, 47547, 47508, 47507, 47438, 46831, 46806, 46784, 39881, 39880, 39851, 39725, 39722, 39702, 35884, 35732, 27006, 26915, 26770, 26724, 26694 26684, 26675-26682, 26576, 26539, 25534, 25458 and 25449 of the seek1 gene, as represented by FIG.
 2. 3. A method according to claim 1, said method comprising determining the presence of an amino acid substitution, deletion or insertion at one or more of positions 24, 34, 37, 40 or 133 of the SEEK1 amino acid sequence, as represented by FIG. 3(i), or the presence of a SEEK1 protein fragment having the amino acid sequence as represented by FIG. 3(ii) or 3(iii).
 4. A polynucleotide comprising a nucleic acid sequence encoding the SEEK1 gene, wherein said SEEK1 gene comprises a nucleotide substitution, deletion or insertion at one or more of positions 51814, 51789, 51759, 51570, 51505, 51462, 51265, 51216, 51124, 51078, 51017, 51008, 50920, 50901, 50801, 50049, 49405-49407, 49160, 49133, 49045, 49038, 49017, 48920, 48773, 47938, 47868, 47852, 47826, 47661, 47645, 47567, 47547, 47508, 47507, 47438, 46831, 46806, 46784, 39881, 39880, 39851, 39725, 39722, 39702, 35884, 35732, 27006, 26915, 26770, 26724, 26694 26684, 26675-26682, 26576, 26539, 25534, 25458 and 25449 of the SEEK1 gene, as represented by FIG.
 2. 5. A polynucleotide comprising a nucleic acid sequence encoding a fragment of the SEEK1 gene, wherein said fragment comprises a nucleotide substitution, deletion or insertion according to claim
 4. 6. A polynucleotide comprising a nucleic acid sequence which hybridises under stringent conditions to the polynucleotide of claim 4 or
 5. 7. A polynucleotide according to claim 6, wherein said nucleic acid preferentially hybridises to one allele of one or more of the polymorphisms of claim
 4. 8. A vector comprising a polynucleotide according to any one of claims 4 to
 7. 9. A host cell comprising a polynucleotide or vector according to any one of claims 4 to
 8. 10. A protein comprising the amino acid sequence of FIG. 3(i), or a fragment thereof, having an amino acid substitution, deletion or insertion at one or more of positions 24, 34, 37, 40 or 133 of FIG. 3(i).
 11. A protein fragment having the amino acid sequence of FIG. 3(ii) or 3(iii).
 12. An antibody or antibody fragment which reacts with an antigen of a protein or protein fragment according to claim 10 or
 11. 13. A transgenic non-human animal comprising a polynucleotide sequence according to any one of claims 4 to
 8. 14. Use of a transgenic non-human animal according to claim 13 in screening for agents for use in diagnosis or treatment of individuals having, or being susceptible to, SEEK1 mediated disease.
 15. A method of screening for an agent for use in the prognosis, diagnosis or treatment of individuals having or being susceptible to SEEK1 mediated disease, said method comprising contacting a putative agent and with a polynucleotide or protein according to claims 4 to 11, and monitoring the reaction there between.
 16. A method of screening for an agent according to claim 15, further comprising contacting a putative agent with a SEEK1 polynucleotide as represented by FIG. 2 or SEEK1 protein as represented by FIG. 3(i); and comparing the reaction there with a reaction between the agent and polynucleotide or protein of any one of claims 4, 9 or
 10. 17. A method for diagnosing and treating SEEK1 mediated disease in a subject, comprising (i) determining which allele of one or more polymorphisms are present according to claim 1; introducing into the subject the different allele, wherein either a variant allele according to claims 4, 9 or 10 is determined and/or a variant allele of claim 4, 9 or 10 is introduced into the subject.
 18. Use of an allele of one or more polymorphisms of claims 4 or 9 in the manufacture of a medicament for use in the diagnosis and treatment of SEEK1 mediated disease, wherein the method comprises (i) determining which allele of one or more polymorphisms are present according to claims 1 to 3; and (ii) introducing into the subject the different allele, wherein either a variant allele according to claim 4, 9 or 10 is determined and/or a variant allele of claim 4, 9 or 10 is introduced into the subject.
 19. A kit for use in diagnosis of an individual having, or being susceptible to, SEEK1 mediated disease, said kit comprising an agent for detection of a polynucleotide, protein or fragment thereof according to any one of claims 4 to
 10. 20. A kit according to claim 19 wherein the agent comprises a polynucleotide according to claim 4 to 8, an antibody according to claim 11 or restriction enzymes for digestion of a polynucleotide according to claim 9 or
 10. 